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For Lois} and Ellen 


Preface 


The enormous amount of work that has been done on the language of the Jewish Bible is 
still limited in its range. The phonology of Biblical Hebrew (consonants, vowels, accents) and the 
morphology (the phonetic/phonemic patterns and grammatical structures of words) have been thor- 
oughly described. But as one moves into grammar proper, the work becomes more and more selec- 
tive and incomplete. Even at the level of small phrases, there are many phenomena that have not 
been accounted for or even noticed. The familiar constituents of clauses, such as verbs, subjects and 
objects, have been discussed, of course, but many other clause constituents have not received much 
attention from Hebraists, compared with the work that linguists have done with other languages. 
There is not yet a comprehensive, systematic, principled investigation of whole clauses as the basic 
units of syntax. The rigorous exploration of the grammar of discourse structures larger than clauses 
is only in its beginning stages. 

In addition to incomplete coverage of phenomena, another limitation in previous work is its 
concentration on the narrative prose texts of the Hebrew Bible, with less attention to prophetic, 
poetic, and wisdom writings. The challenge is to study the entire text of the Hebrew Bible at every 
level of grammatical organization, from phoneme to discourse, as completely as possible. This task 
has become more feasible with the availability of electronic texts for computer-enabled research, 
assisted by the development of linguistic theories that supply appropriate concepts and terminology. 
This is the burgeoning field of corpus linguistics. 

Compared with other corpora, the evidence for the language used by the people of Israel in 
biblical times is meager. The Hebrew Bible, supplemented by contemporary inscriptions found by 
archaeologists and by extra-canonical compositions, such as the Dead Sea Scrolls, proffers about a 
half million words—the number of words in the Hebrew Bible is a little more than 300,000. Yet a 
comprehensive investigation of the grammar of these few texts has not yet been achieved. 

Corpus linguistics aims at the analysis and description of all the grammatical phenomena in a 
given body of text in some “canon,” such as the complete works of one author. This book takes 
the first steps toward investigation of the grammar of the Hebrew found in just one version of the 
Hebrew Bible. 

Biblical Hebrew Grammar Visualized (BHGY) is an initial foray, trying out the corpus linguis- 
tics way of doing research on the grammar of Biblical Hebrew. While there is much in the book 
that beginning students of Biblical Hebrew will grasp, it is primarily intended for intermediate and 
advanced students of Biblical Hebrew, as well as researchers in the area. Its data consist of phrase 
markers for each and every distinct clause in the Hebrew Bible. Each phrase marker gives a picto- 
rial representation of the constituents of each clause in the form of a structure (a tree or graph) that 
shows the grammatical functions and semantic roles of clause constituents, along with the gram- 
matical relations that bind the constituents into linguistic structures. 

There is virtually no limit to the questions that may be addressed to the data in this form, no 
limit to the amount of detail that the phrase markers can reveal. Understandably, this book cannot 


xi 
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report more than a few small and representative samples of the fruits of the linguistic research that 
is enabled by our new tools. Many of the results are presented here in the form of tallies of the in- 
cidence of selected phenomena, often as bar charts. Close-up details are accessible by interrogating 
the full database of phrase markers and texts available from Logos Bible Software. 

This book sets forth the linguistic theory on which the grammatical analysis is based. It explains 
the terms used. It presents a few sample studies that will illustrate the potential of this approach for 
research. It suggests topics for further investigation. 

Some of the material in this book will be new to students of Biblical Hebrew who do not have 
a background in contemporary linguistics. This book will supply some of that background. It does 
not attempt a full-scale discussion of all of the theoretical issues that lurk in its foundations, but it 
points to the main literature that we draw upon. Nor does it seek “the one true grammar for Biblical 
Hebrew." We believe that superior insights emerge when scholars approach the text via many paths. 

In spite of the seeming complexity of many clauses (some are quite long and convoluted), most 
clauses are short, and the basic grammatical structures in all of them are actually quite simple. A 
construction consists of constituents in relationships, many of them binary (with two members 
joined together in some way). Because so many structures are small and simple, they are acces- 
sible to students whose understanding of Hebrew grammar may still be in its early stages. There 
are numerous simple searches that they can carry out immediately to find small phrases and short 
clauses. They can narrow the searches by specifying the vocabulary item(s) they wish to be pres- 
ent, enriching familiar ^word" searches beyond mere co-occurrence patterns into the grammatical 
constructions in which some word is used. This is just a beginning. More advanced students and 
scholars engaged in cutting-edge research may find the investigation of more complex structures 
more challenging, but all the more rewarding for that. 

It is important to make it clear that this book is not a user manual for the Andersen-Forbes 
data as realized by the Logos software. Our simple web site (Andersen-Forbes.org) contains white 
papers intended to assist would-be searchers of our data plus other useful materials. Nor does the 
book seek to be a reference grammar. 

Because this is an initial foray, there is much in the book that is unfinished and provisional. We 
try to be upfront about this. Addressing real problems has enhanced our repertoire of grammatical 
functions and semantic roles; it has also shown the need for further work. The data will be enriched 
and made more transparent when we have completed the full representation of the grammatical 
functions and semantic roles of all the clause immediate constituents. 

Because discourse grammar and clause syntax are parts of the same fabric of Hebrew grammar, 
we expect the work on clause syntax presented here to move seamlessly into the discourse grammar 
that we hope to present in a future volume. The final chapter of this book offers a foretaste of our 
approach to discourse analysis. 


A word about our glosses: The glosses that we supply throughout this book are included to as- 
sist readers. They are type glosses, not token glosses. That is, one gloss is applied to each form (the 
"type") and is used whenever that form appears in the text (as a “token’’). Consequently, the glosses 
are jarring at times. They in no way attempt to supply an interlinear translation. (For more on our 
glosses, see Appendix 7, $A7.1.) 
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The Structure of This Book 


Chapters 1—4 provide readers with important introductory information on our approach: the text 
adopted, our general linguistic perspective, the syntactic representation used, division of the text 
into clauses and into individual segments, and our part-of-speech system. 

Chapters 5 and 6 provide an extended survey of the basic and complex phrases found in Bibli- 
cal Hebrew. 

Chapters 7 and 8 characterize the clauses and clause-like structures of Biblical Hebrew. We 
examine both main and embedded structures. 

Chapters 9-17 study clause immediate constituents (CICs)—the major constituents that com- 
bine to form clauses—from multiple perspectives. We introduce the five disparate types of CICs 
(Chapter 9) and then focus especially on the semantic roles fulfilled by various CICs (Chapter 10). 
We introduce methods for assessing the composition, incidence, and ordering of CIC types (Chap- 
ter 11). We then use these methods to investigate the characteristics of the clauses based on four 
very frequently attested verb roots (Chapters 12-15). We next examine the make up of the various 
CICs (Chapter 16) and then explain the uses of, and computation of, a way of comparing sets of 
clauses in terms of the incidence patterns of their CICs (Chapter 17). 

Chapters 18-21 tie up some loose ends. We examine the characteristics of quasiverbals (Chap- 
ter 18) and verbless clauses (Chapter 19). We scrutinize phenomena that significantly complicate 
syntactic representation (Chapter 20). Finally, we present supra-clausal structures and introduce the 
rudiments of our approach to discourse analysis (Chapter 21). 

The seven appendixes are designed for specialists. Motivated general readers will be able to 
grasp much of the material but should, with little hesitation, skip over material that seems too tech- 
nical for them. The appendixes deal with details of text preparation (Appendix 1), our approach to 
linguistics (Appendix 2), various ways of approaching positional syntax (Appendix 3), alternation 
of the indirect object form in the WX corpus (Appendix 4), statistically rigorous compositional 
analysis (Appendix 5), pairing verbless clauses (Appendix 6), and enhancements of our database, 
realized and planned (Appendix 7). 

The book concludes with a glossary, bibliography, and three kinds of indexes (authors, scrip- 
ture, and topics). 
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Global Abbreviations 


Abbreviations used throughout the book are included here; locally-used abbreviations are not. Brief 
definitions of items set in the Eurostile font in the right-hand column may be found in the Glossary. 


Aleppo Codex 

American Standard Version 

Brown, E; Driver, S. R.; Briggs, C. A. A Hebrew and English Lexicon of the Old Testament. 
Oxford: Clarendon, 1907. 

Dotan, A., editor. Biblia Hebraica Leningradensia. Peabody, MA: Hendrickson, 2001. 
Elliger, K., and Rudolph, W., editors. Biblia Hebraica Stuttgartensia. Stuttgart: Deutsche 
Bibelgesellshaft, 1984. 

cophenetic correlation coefficient 

clause immediate constituent (a central concept) 

discourse analysis 

discourse unit 

elementary discourse unit 

grammatical function (a central concept) 

grammatical function/semantic role 

Kautzsch, E., editor. Gesenius' Hebrew Grammar. Translated by A. E. Cowley. 2nd ed. 
Oxford: Oxford University Press, 1910. 

King James Version 

Leningrad Codex 

Greek Septuagint 

Masoretic Text 

New International Version 

New Jewish Publication Society 

noun phrase 

New Revised Standard Version 

part(s) of speech (a central concept) 

prepositional phrase 

quasiverbal 

role and reference grammar 

semantic role (a central concept) 

verbless clause 

verb phrase 


Note: Terms that are defined in the glossary are flagged in the text by setting them in the Eurostile 
font when we believe an assist might be helpful. 
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Ist 
2nd obj 


abs 
abst 
accmp 
acmpl 
adj 
anch 
anml 


app 


cmpl 
cmpr 
cogv cmpl 
conc 
cond 
coord cj 
crd 

cstr 


D 

d 

DA 

def 
deprv 
dir obj 
discrs 
disj 

dl and / dl seq w 
DRS 
dstr app 


dstr obj 
DU 


EDU 

EN 

excl 

excl voctv labl 
ext 


first in clause 
second object 


absolute 

abstract 

accompanier 
adjectival complement 
adjunct 

anchored 

animal semantics 
apposition 


beneficiary 

building semantics 
bonded 

body part semantics 


consonant 

common 

collective 

cause 

cophenetic correlation coefficient 
clause immediate constituent 
clause 

complement 

comparison 

cognitive complement 
concurrency feature 
condition 

coordinating conjunction 
cardinal number 

construct 


dual 

distributive 

discourse analysis 

definiteness 

deprivation 

direct object 

discourse 

disjunction 

discourse level sequential waw 
discourse representation structure 
distributed apposition 
distributive object 

discourse unit 


elementary discourse units 
enriching constituent 

exclamative 

exclamative vocative clause label?? 
extent 


F/fem. 
furn 


gam 
geog 
GF 

gram 


humn 


I/ind obj 
IC 

incl 

inf abs 
inf abs amp 
inf abs int 
inf cst 

inf uttr 
instr 

inv 

int 

intrg 


Janus sc/sb 
juxt 


laps 

Ibl 

loc 

loc distributive 


m/masc. 
mdl 
mix list 


MN/mnr 
mntl 
mod 
mvt aim 
mvt dir 
mvt orig 


N/n 
neb 
neg 
nested 


nmlzr 
nom 
nom inf 


feminine 
furniture semantics 


Heb. word gam ‘also’ 

geographic semantics 

grammatical function 

grammatical knowledge / grammar 
determined 


human semantics 


indirect object 

immediate constituents 
includer 

infinitive absolute 

infinitive absolute amplifier 
infinitive absolute intensifier 
infinitive construct 
infinitive of utterance 
instrumental 

inverse modification 
intensifier 

interrogative 


Janus subject-complement/subject 
juxtaposed 


lapsii calami 

label 

locational semantics 
distributive location 


masculine 

modal 

mixed list (some items in a list flank 
conjunctions, and others do not) 
manner 

mental semantics 

modification 

movement aim 

movement direction 

movement origins 


noun (phrase) 

nebulous 

negative 

nested phrase (neither subsetting nor 
supersetting) 

nominalizer 

nominalized construction 
nominalized infinitive constituent 
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nota acc. 


NP 
O/obj 
obj addr 
obj cmpl 
obj mk 
oblq 
open intrg 
P/pl. 
part 
pdox 
pers. 
pers pr 
PN 
POS 
poss 

PP 

PR 

prd 
prep 
pron 

PS 

ptc 

q closed 
qtr 

qty 
qual 
QV/qv 
refrnt 
rem 
resum 
rhet 
RRG 
rslt 

rsn 

rsrc 


nota accusitivi (definite-object-marker 
word ?et in Hebrew) 

noun phrase 

object 

object of address 

object complement 

object marking 

obliqueness 

open question 

plural 

participle 

paradoxical licensing relation 
person 

personal pronoun 

proper noun 

part of speech 

possessive 

prepositional phrase 
participant 

predicator 

preposition 

pronoun 

phrase structure 

participle 

closed question 

quoter 

semantics of quantity 
semantics of quality 
quasiverbal 

referential 

remainder 

resumption 

rhetorical question operator 
role and reference grammar 
result 

reason 

resource 


rstr 
S/sent 
S/sing 
sbj 

sbj addr 
sbj cmpl 
sbj distr 
sbst 
SCCs 
Sem 
sent/discrs 
SP 

sptl 

SR 
subord 
subset 


suff 
superset 


susp 


™ 
tm int 
tm pt 


unanch 
undes 
utnsl 
V/vb 
val 
veg 
VLC 
vocbl 
voctv 
VP 
war 


restricter 

sentence 

singular 

subject 

subject of address 

subject complement 
distributive subject 
substantive 

supra-clausal constituents 
semantics 

sentence licensed in discourse 
spatial 

spatial semantics 

semantic role 

subordinate 

subsetting phrases (from most general 
to least general) 

suffixation 

supersetting phrases (from least 
general to most general) 
suspension 


temporal 
time interval 
time point 


unanchored 
undesirable outcome 
utensil semantics 
finite verb (predicator) 
valuable 

vegetation semantics 
verbless clause 
vocable semantics 
vocative 

verb phrase 

war semantics 


Chapter 1 


Introduction: 
What We Mean by “Biblical Hebrew," 
“Grammar,” and “Visualized” 


1.1 Biblical Hebrew 


In this book, Biblical Hebrew is the language contained in all of the Hebrew portions of the 
Jewish Bible as attested in the Leningrad Codex (L). 


1.1.1 Choosing the Text(s) 


Biblical Hebrew in our book title reflects the fact that we have limited the corpus here consid- 
ered to the Jewish T"1n minus its Aramaic portions (see §1.1.3.5). Given that early decision, we 
next had to choose which text(s) of the Jewish Bible to rely upon. We could pick one complete 
“best” manuscript, or we could select “best portions” drawn from several manuscripts. That is, we 
could opt for the Leningrad Codex, L (dated 1008 c.£.), essentially complete and judged to be of 
high quality—the text transcribed in the Biblia Hebraica Stuttgartensia (BHS). Or we could piece 
together excellent manuscripts to synthesize a complete text: 


e Cairo Codex of the Prophets, C, dated 895 c.r. 

* Petersburg Codex of the Latter Prophets, P, dated 916 c.E. 

e Aleppo Codex, A, dated ~930 c.£., lacking most of the Torah as well as many Writings 
e British Museum Or 4445, B, dated ~950 c.£., consisting of Gen 39:20-Deut 1:33 

* Various texts from Qumran 


Both options were open since each specifically mentioned manuscript (L, C, P, A, B) was avail- 
able, in that its text existed in photographic, transcribed, or facsimile form. 

Because it was (essentially) complete and enjoyed wide scholarly acceptance, in 1970 we chose 
to base our computer-readable text on the Leningrad Codex. To date, we have seen no compelling 
reason to alter that decision. ! 


1.1.2 Correcting “Obvious Errors" in L 
We depart from readings provided by L when we judge those readings to be simple scribal 
errors. Such erroneous readings are typically signaled in the apparatus of BHS by a “sic L” note. 


Authors’ note: A grasp of this chapter, especially $1.3, is vital for understanding this book. We suggest readers first go 
through the chapter without consulting appendixes. Where more information is desired, consult relevant appendixes dur- 
ing a second pass. Terms in underlined Eurostile font (such as normal) may be found in our glossary. 

1. Readers desiring a more detailed discussion of text choice should read appendix §A1.1. 
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Across the whole of BHS, we count 368 “sic L" notes. We change what we consider an erroneous 
reading 216 times, 5996 of the instances.? 

In addition to correcting about three-fifths of the “sic L" anomalies flagged in BHS, we have 
made three changes that are not signaled in BHS by a “sic L” note.? 

We have also restored Josh 21:36-37, as per BHS and Dotan's BHL. This patch is now taken 
for granted, but one may have misgivings as to the quality of the inserted text. The apparatus of 
BHS says "exstat in mlt Mss Edd" but does not disclose the source of the text selected. Works of 
reference, such as those by E. Tov and by Robert G. Boling and G. Ernest Wright, are perfunctory.* 
Benjamin Kennicott reported the turbulent variety in the readings in the sources.? He was particu- 
larly struck by manuscripts that had four additional Hebrew words in the “missing” text (“to be 
a city of refuge for the slayer"9) and one manuscript with a fifth extra word (“in the wilderness," 
agreeing with the LXX). These tiny differences in themselves may not seem to matter. But they 
raise a serious issue. The mystique of the ben Asher tradition, greatly exaggerated by Kahle, and 
the consequent prestige of the successive editions of Biblia Hebraica do not warrant acceptance of 
L as “all we need to know about Biblical Hebrew." In due time, the variants already available in the 
Qere I Kethiv apparatus should be augmented from sources with a claim to be considered “Biblical 
Hebrew," particularly the biblical texts among the Dead Sea Scrolls. 

Early on, we decided that we would not attempt to make emendations. Of course, there are im- 
portant places where a particular emendation makes excellent sense and rescues the analysis from 
paralysis. But, in the interest of consistency, we have omitted even these well-justified emenda- 
tions. The costs paid for our determined avoidance of emendations are rather far-fetched analyses 
of some damaged passages. 


1.1.3 Reducing the Text 


Because we knew that production of a syntactic representation of Biblical Hebrew would be a 
massive undertaking, we decided to limit our initial analysis to a single unequivocal text derived 
from our chosen manuscript by applying to our minimally corrected text of L the following five 
sorts of reductions: 


* Omission of cantillations 

* Choice of Kethiv readings, setting aside the Qere variants 
* Resolution of lexical ambiguity 

* Resolution of structural ambiguity 

* Omission of Aramaic text blocks 


2. For additional discussion, see appendix $A1.2.1 and $A 1.2.2. 

3. For the specifics, see appendix §A1.2.3. 

4. Emanuel Tov, Textual Criticism of the Hebrew Bible (2nd ed.; Minneapolis: Fortress, 2001) 238. Robert G. Bol- 
ing and G. Ernest Wright, Joshua (AB 6: Garden City, NY: Doubleday, 1995) 483. 

5. Benjamin Kennicott, The State of the Printed Hebrew Text of the Old Testament Considered (Dissertation I, 
Oxford, 1753: 440, 552; Dissertation II, Oxford, 1759: 285, 330-32, 390, 485, 487, 571). 

6. Ibid., 1.440, II.332. 

7. Ibid., 11.332. 
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1.1.3.1 Omission of Cantillations 

We used cantillations in resolving certain homographic normal and construct forms and in 
resolving the ambiguity of words that differed only in stress position. But, given the turbulence ev- 
ident in the marking of cantillations and given the limited technical resources available to us early 
on, we elected not to include cantillations in our text markup.*? 
1.1.3.2. Kethiv Text Analyzed 

Our computer text includes both the Qere and Kethiv readings found in L, the Kethivs having 
been vocalized in accordance with Gordis.? 

The presence of the Qere / Kethiv alternate readings presents the corpus grammarian with three 
options. The very fact that the sages included Qere/ Kethiv pairs in the text makes them part of the 
text and also betokens an attitude of neutrality over their respective merits. It cannot be inferred that 
placing the Kethiv in the body of the text gives it a superior claim, especially when we notice that 
Qere and Kethiv sometimes swap places. 


1. First option. Accept all the data on a level field and make two representations of each 
clause that contains a Qere/ Kethiv pair of words that differ in syntax. This procedure is the 
same in principle as multiple representations of structurally ambiguous clauses (§1.1.3.4). 

2. Second option. Choose the better of the two, case by case. This would permit palpably 
recalcitrant variants to be discarded. However, adopting this policy would involve tricky 
choices, tantamount to deciding on a single text. This could not be done responsibly 
without spelling out the reasons for each choice, an onerous task that is best left to 
specialized Qere/ Kethiv researchers. '? 

3. Third option. As an interim measure, choose either all of the Qere or all of the Kethiv 
variants. 


To achieve maximal simplicity in representing syntactic structures, we decided to use all of the 
Kethiv variants, without prejudice relative to the merits of the Qere alternatives. A price is paid for 
this choice. We are left with impossible Kethivs that must be parsed as nebulous (§9.3.1.2). 

The first option might be followed later. We emphasize that, because of the shortcomings of L's 
Qere I Kethiv repertoire, our electronic text cannot contribute much to the study of the Qere / Kethiv 
phenomenon. !! 


1.1.3.3 Lexical Ambiguity Resolved 

In line with our decision to seek maximal representational simplicity initially, we have resolved 
lexical ambiguity. A given word may exhibit lexical ambiguity. That is, it may be one of several 
possible lexemes that share the same spelling. Words of this sort are homographs, "lexemes . . . 
which have the same spelling but differ in meaning." ? Of 44,026 entries in our dictionary, at present 
6,546 involve homography. Failure to resolve homography where a grapheme appears as multiple 
parts of speech represents an error on our part. Resolution of within-part-of-speech homography is 
an ongoing process carried out when we notice that a particular gloss is misleading at times." 


8. For more discussion, see appendix §A1.3.1. 
9. Robert Gordis, The Biblical Text in the Making: A Study of the Kethib-Qere (Jersey City, NJ: Ktav, 1971). 
10. Maimon Cohen, The Kethib and Qeri System in the Biblical Text (Jerusalem: Magnes, 2007). 
11. Readers desiring more detail should consult appendix $A1.3.2. 
12. David Crystal, A Dictionary of Linguistics and Phonetics (5th ed.; Oxford: Blackwell, 2003) 220. 
13. For specific examples and further discussion, see appendix §A1.3.3. 
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In specifying our text, our policy has been to resolve many instances of lexical ambiguity. At 
times, this has forced us to resolve fundamental, even intentional ambiguity by over-reader fiat. 
This policy should not be allowed to negate the claims of other readings. 


1.1.3.4 Structural Ambiguity Not Represented Initially 
Consider this structurally ambiguous clause from Exod 2:1: 


1) man WR TM 
and + he-walked + man + from + house-of + Levi 


The prepositional phrase (from house-of Levi) can: 


1. modify man (a man from the house of Levi walked) [house = tribe], 
or 
2. specify the movement origin (a man walked from the house of Levi) [house = building]. 


Since we have elected for the present to represent only one parse of a structurally ambiguous text, 
we must decide which of the interpretations is the more salient. In the present instance, we draw 
on world knowledge to the effect that “house-of Levi" refers to a kinship group and decide that the 
first option is the preferred one. 

The structural ambiguity in the example is easily resolved, but resolution is not always so easy. 
As is explained in §20.3, we have made provision for the representation of ambiguity in the future. 


1.1.3.5 Exclusion of Aramaic Blocks 

Our database includes the text and syntactic analysis for all of the 7’’1n. Hence, users of our 
database have three language options—Hebrew only, Aramaic only, or the entire 1n. 

This book is based, however, on only the Hebrew chapters and verses. Our counts and ex- 
amples are drawn from the Hebrew portions. We omit the following blocks of Aramaic text: Jer 
10:11; Dan 2:4b-49 and chaps. 3, 4, 5, 6, and 7; Ezra 4:8-24, chap. 5, 6:1-18, and 7:12-26. The 
text for these passages contains 4,827 words. Hence, they account for around 1.6% of the census of 
words in the complete 7’’1n, which consists of 305,520 words. 


1.2 Grammar 


Readers may be assisted by knowing from the outset our stance regarding linguistic theory. In 
setting out the way our work relates to the Hebrew grammar that is enshrined in the grammars and 
other works of reference that most students of Biblical Hebrew use, we do not debate the merits of 
our different ways of doing things, nor do we go into much detail. In this introductory section, we 
emphasize three critical grammar-related points: 


]. We accept and affirm and continue to use as much as we can of previous work on Hebrew 
grammar. 

2. We are eclectics, drawing insights from multiple linguistic formalisms. 

3. Too often, statements alleged to be about the syntax of Biblical Hebrew are, in fact, 
statements about the syntax of translation(s). We try to be very sensitive to this trap, 
actively seeking to avoid it. 
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1.2.1 Traditional Approaches to the Syntax of Biblical Hebrew 

For the most part, what traditional Hebrew grammar calls syntax is restricted to the short-range 
functions of the various word classes. Our syntax begins where traditional treatments leave off. The 
two approaches are complementary, and most of what traditional Hebrew grammar has done can 
stand, especially the extensive treatments of morphology.'* Taking the “word” as its working unit, 
traditional Hebrew grammar is rather fragmented. By contrast, our treatment aims to be holistic. 
Our major working units are whole clauses. 

Behind the old mind set was a belief that language was logical, that grammar was consistent, 
and that Biblical Hebrew grammar was uniform and homogeneous. This was the underlying view, 
whether one’s approach rested on academic scientific rationalism or on the pious belief that sacred 
Scripture had the unique benefit of divine authorship and therefore was immaculate, its inerrancy 
preserved by divine providence. 

In contradiction to all that, and without any detriment to the qualities of a holy book, we are 
committed simply to describing what is there, recognizing variety and diversity, and seeking, some- 
day perhaps, to correlate variety with the numerous variables that likely were at work—date, text 
type, dialect, register, and so on—with corruption appearing as the last option, reluctantly invoked, 
when all other explanations fail to convince. ? 


1.2.2 Eclecticism and Modern Linguistics 


We have relied on many of the insights and methods developed by modern linguists. Rather 
than pausing along the way to explain these influences or expecting readers to infer them, we will 
briefly sketch the major influences here. Our basic stance with regard to modern linguistics is as 
follows: 


1. Our emphases are those of the corpus linguists rather than those of the transformationalists. 
2. We are, nonetheless, generativists with a preference for phrase-structure grammars. 


1.2.2.1 Emphases Drawn from Modern Linguistics 

For us, many of the emphases in renascent corpus linguistics—the linguistic analyses of texts— 
are superior to those insisted upon by the Chomskyans in the latter decades of the last century. 
Specifically, for our work, we affirm the priority of: ' 


* Empiricism over rationalism 

* Performance over competence 

* Language description over language universals 
* Quantitative models over qualitative models 

e Surface structure over deep structure 

* Functional approaches over formal approaches 


14. Joshua Blau, Phonology and Morphology of Biblical Hebrew: An Introduction (Linguistic Studies in Ancient 
West Semitic 2; Winona Lake, IN: Eisenbrauns, 2010). 

15. More detail on traditional approaches to the grammar of Biblical Hebrew can be found in appendix $A2.1. 

16. The first four contrasts are adapted from Geoffrey Leech, “Corpora and Theories of Linguistic Performance,” 
in Directions in Corpus Linguistics (ed. Jan Svartvik; Berlin: De Gruyter, 1992) 107. 
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The fiats of the Chomskyans are intelligible once one understands that their goal was the demoli- 
tion of American structuralism. Since Chomsky and his disciples saw structuralism as misguided 
in its reliance on empirical methods, they strongly preferred rationalist approaches based on native- 
speaker intuitions. Description of particular languages was to be supplanted by explanation in terms 
of language universals. Since they saw language performance as fatally compromised by random 
phenomena, they focused on language competence and the codification of native-speaker knowl- 
edge of language. Since they saw the structuralists’ zeal in describing and quantifying the surface 
details of language performance as a dead-end activity, they opted to reconstruct so-called deep 
structure. For the Chomskyans, structuralism’s reliance on a loose functionalism rendered it unsci- 
entific, while rigorous formalism led to genuine scientific progress. 

The Chomskyan critique of structuralism did properly skewer many of its excesses, but the 
Chomskyans went too far, reaching a point where almost nothing in the structuralist program was 
worth salvaging. Trask is accurate when he asserts that “the early generative linguists came to use 
‘structuralist’ as a term of abuse.” 7 The onslaught was withering. “Chomsky had, effectively, put to 
flight the corpus linguistics of an earlier generation. His view on the inadequacy of corpora, and the 
adequacy of intuition, became the orthodoxy of a succeeding generation of theoretical linguists.” '® 

It took a few decades, but corpus linguistics (or textlinguistics) has been rehabilitated by the 
computational linguists. The primacy of performance data over native-speaker intuitions has been 
reasserted. Description has again become defensible, even informative. Powerful statistical methods 
are flourishing. ? In short, many of the excesses of early scorched-earth generativism have been or 
are being redressed. 


1.2.2.2 Phrase-Structure Grammars 

As data-driven approaches to linguistic study have made a comeback, the previously dismissed 
methods of phrase-structure grammar have been impressively generalized and shown to be ca- 
pable of covering a vast array of linguistic phenomena without ever resorting to transformations 
or positing deep structure. Over time, we have been increasingly drawn to this branch of syntactic 
theory. According to Matthews, a phrase-structure grammar is:7° 


Any form of generative grammar consisting only of phrase structure rules. Hence any grammar which 
assigns to sentences a type of structure that can be represented by a single phrase structure tree. 


Our grammars are phrase-structure grammars, since our computational parsing algorithms involve 
phrase-structure rules?! and since the resulting phrase markers are usually trees.” (“Phrase mark- 
ers" and “trees” are introduced in $1.3.) 


17. R. L. Trask, A Dictionary of Grammatical Terms (London: Routledge, 1993) 263. 

18. Geoffrey Leech, “The State of the Art in Corpus Linguistics,” in English Corpus Linguistics: Studies in Honour 
of Jan Svartvik (ed. K. Aijmer and B. Alterberg; London: Longman, 1991) 8. 

19. Christopher D. Manning and Heinrich Schütze, Foundations of Statistical Natural Language Processing (Cam- 
bridge, MA: MIT Press, 1999). 

20. P. H. Matthews, Oxford Concise Dictionary (Oxford: Oxford University Press, 1997) 280. 

21. See appendix $A22 for information on our approach to the computer parsing of our text. 

22. More precisely, as will be explained in chap. 20, our phrase markers are trees except when we encounter phe- 
nomena that force us to generalize from trees to graphs. 
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1.2.8 The Autonomy of Syntax—Not 

In this subsection, we consider the notion of the autonomy of syntax: “the doctrine that syntax 
can and should be studied in isolation from other branches of linguistics and most particularly from 
semantics." ?? Our rejection of this doctrine is central to our representation of the grammatical func- 
tions of Biblical Hebrew. As needed, we bring morphology, semantics, discourse analysis, and 
world knowledge into our analyses of Biblical Hebrew. Thus, our procedures involve bottom-up 
syntactic analysis as far as it will take us, but we also engage in top-down analysis when only that 
enables what seems to us to be a proper overall analysis.” 
1.2.3.4 The Role of Morphology 

Morphological analysis underlies our assignment of parts-of-speech labels to segments and of 
grammatical characteristics to these segments. We will not discuss morphological analysis in this 
book, since it is adequately covered in intermediate books on the grammar of Biblical Hebrew. 
1.2.3.2 The Role of Semantics 

In advancing a strict hierarchy of discovery procedures for inferring grammars, the structuralists 
argued that the various domains of linguistic inquiry were separate. In their view, syntax was au- 
tonomous from morphology and semantics, so “no syntactic analysis could be undertaken until the 
morphological analysis was complete."? Appeals to semantic information were also out of bounds 
as one did syntactic analysis. This view was maintained, at least by implication, by the generativ- 
ists. Over time, this strict view of separation has eroded. Of late, “linguists have often been willing 
to accept that syntactic analysis must, at least sometimes, take note of semantic facts." ?^ We go 
further, including explicit semantic information in our grammatical representations. 
1.2.3.8 The Role of Discourse Analysis 

The common view among linguists is that the sentence is “the largest unit of grammar," ^ a 
view that we do not share. We see grammar as extending above and below? the sentence into dis- 
course. We take discourse analysis to be a serious discipline, one that we have begun to address.” 

Discourse analysts divide into two camps: linguists who see discourse analysis as an endeavor 
separate from the analysis of syntax (“the disjoint camp") and linguists who see discourse analysis 
as a continuation of clause-level analysis into the higher reaches of texts (“the unified camp"). We 
identify with the latter camp, seeing grammatical relations as transitioning smoothly into discourse. 

We will encounter situations in which detailed discourse analysis is a prerequisite for doing 
adequate syntactic analysis. 


»27 


23. Trask, A Dictionary, 24. 

24. For a few added details on this topic, see appendix $A2.2. 

25. R. L. Trask, A Dictionary, 263. 

26. Ibid., 24. 

27. Matthews, Oxford Concise Dictionary, 337. This is also the view of, among many others, Trask, A Dictionary, 
250; Crystal, A Dictionary, 414; also, M. G. Dareau, “Glossary,” in Concise Encyclopedia of Grammatical Categories 
(ed. Keith Brown and Jim Miller; Oxford: Elsevier, 1999) 436. 

28. We note that discourse units are quite commonly embedded in clauses. 

29. The next of our planned volumes will deal with the discourse analysis of Biblical Hebrew. For a provisional 
introduction to our approach, see chap. 21 below. 
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1.2.4 Filling a Gap in Traditional Treatments 

In this book, we attempt to fill a gap in the traditional treatments of Hebrew clause grammar. It 
is a commonplace in linguistics that the clause is the basic unit of grammar, discrete and integral. 
However, traditional Hebrew grammars have limited treatments of the internal syntax of whole 
clauses. The literature provides very few principled, systematic, exhaustive accounts of the gram- 
matical functions and relations operating among all of the constituents in the complete clauses of 
Biblical Hebrew in all their variety.*° 

When it comes to the clause, traditional Hebrew grammars are partial and disjointed. They list 
the typical grammatical functions of the various parts of speech, supplying a few illustrations. Most 
of the text of the Hebrew Bible is never dealt with. So it is misleading to present a work as a gram- 
mar of Biblical Hebrew, when all it deals with are selected grammatical functions and constructions 
as found in only a fraction of the text. Corpus linguistics demands complete coverage. 

To make this point, we only need to examine two leading works. What is under scrutiny is the 
way of doing Hebrew grammar inherited from our fathers in the discipline. 

Waltke and O’Connor (IBHS). The first half of JBHS reviews the syntactic functions (not 
always distinguished from “meanings”) of nouns, adjectives, numerals, and pronouns. There are 
four pages on “Aspects of the Syntax of Prepositions.” The second half takes up verb stems, whose 
functions are listed and illustrated. The treatment remains at the level of microsyntax—that is, it 
deals with small constructions. It does not advance to the study of the full array of constituents that 
each verb evokes in the clauses in which it is the predicator. 

At the end of the treatment of verb syntax, JBHS has two short chapters on the subordination 
and coordination of clauses. These chapters view clauses as units in higher structures (sentences), 
and their functions are classified along traditional lines as conditional, causal, temporal, and so on. 
Here we see the beginnings of rudimentary bottom-up discourse grammar. Holistic top-down anal- 
ysis of clause-internal grammar is nowhere to be found. 

Joiion/Muraoka. Joiion/Muraoka is a general reference grammar in three parts: orthography 
and phonetics, morphology, and syntax. Because the orthographic “word” is taken as the unit of 
grammar, some syntax is found in the treatment of morphology, as in the recognition of a pronoun 
suffix on a verb as its object. Anticipating the traditional treatment of the grammar of nouns under 
“cases,” a little additional syntax is found in the discussion of noun inflection, such as when a con- 
struct noun or suffixed stem is said to be “in the genitive” (p. 260). 

Part 3 of Joüon/Muraoka (“Syntax”) deals with the short-range syntactic functions (micro- 
syntax) of several parts of speech, beginning with the verb. Chapter 1 (“Tenses and Moods") is 
concerned with the “various semantic values" (p. 350) of the verb forms. The grammatical sig- 
nificance of the semantics of verb roots is not taken into account. Otherwise, what is called “verb 
syntax" is engaged with the problem of tense and aspect in the Hebrew verbal system. These are 
important parts of Hebrew grammar, but they do not take us very far into syntax and do not even 
begin to deal with internal clause syntax as such. There is no interest in matters of transitivity or 
valence in the perspective of the relations of a verb to all the other clause immediate constituents 
in its clause. Chapter 7 (“Clauses”) distinguishes nominal and verbal clauses and then discusses 


30. Useful treatments, however, are to be found in the work of Wolfgang Richter and his students. See, for ex- 
ample, W. Richter, Grundlagen einer althebrüischer Grammatik, vol. 3: Der Satz (Arbeiten zu Text and Sprache im 
Alten Testament 13; St. Ottilien: EOS, 1980). 
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a miscellaneous list of “particular clauses”: substantival (with noun-like functions), relative, cir- 
cumstantial, negative, interrogative, exclamatory, optative, asseverative, curse and oath, temporal, 
conditional, final, consecutive, causal and explicative, concessive, adversative, exceptive, compara- 
tive, disjunctive. There does not seem to be any awareness that these categories occupy several 
taxonomic dimensions—logical, pragmatic, and text type. This, again, is the traditional approach, 
as can be seen by comparison with Davidson’s syntax of a century earlier. The enduring value of 
this tradition is appreciated. It takes each clause as a whole, as a discrete unit in discourse, but with 
little consideration of what is inside it. 

The bottom-up treatment of small constructions that never takes a whole clause into its purview 
and the rudimentary bottom-up discussion of the discourse functions of clauses as units—both of 
these approaches are limited by endemic binarism. A top-down holistic analysis of the internal syn- 
tax of complete clauses is required to fill the gap between these two approaches. 


1.3 Visualizing Structure 


1.3.1 Labeled Bracketing 


How the constituents of a phrase combine can be shown via a representation that is much used 
in linguistics, a labeled bracketing. We introduce “labeled bracketing” here because our preferred 
representation, the phrase marker, evolves nicely from it. 

Construction of the labeled bracketing is straightforward. Each sequence of constituents making 
up a phrase is flanked by paired brackets, and then a label specifying the kind of phrase is placed 
immediately within the leading bracket. To promote readability and minimize clutter, our illustra- 
tions of bracketing will be in English, with the segments in italics. 

Let us construct the labeled bracketing for the three-segment word 15, from + fruit + its. 
Consider first the common noun fruit. The labeled bracketing for fruit is [N fruit], where N indi- 
cates that this segment is a noun. Relying on our recollections of grade-school grammar, we assert 
that fruit and its may next be combined to form the larger noun phrase whose structure is [NP [N 
fruit] [pron its]], where “pron” indicates that its is a pronoun. The syntactic structure of the full 
prepositional phrase—abbreviated PP—is then represented as in (1.1). 


(1.1) [PP [prep from] [NP [N fruit] [pron its [| 


This bracketing captures the syntactic structure of the prepositional phrase, but correctly pairing the 
left brackets with their corresponding right brackets can be tedious, especially when the constitu- 
ents become lengthy. We will overcome this limitation in the next section. 

The decoding of labeled brackets only becomes more arduous as more complex constituents are 
analyzed. For example, consider this six-word clause from 2 Kgs 18:8: 


MVT ONWIANX nam Nin 
We translate the six words literally as: 
He + he-hit + [nota acc.] + Philistines + until + Gazah. 


The labels needed for representing the structure of this clause are: 
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Label Constituent Name 

S sentence 

VP verb phrase 

PP prepositional phrase 
PN proper noun 

V verb 

prep preposition 

pron pronoun?! 


In terms of these labels, the labeled bracketing for this simple clause is as in (1.2). 
(1.2) [S [pron Ae][VP [V he-hit][PP [prep [nota acc.]|PN Philistines]|[PP [prep unti/]|PN Gazah]]]] 


This is not easy to decipher. Its structure can be made clearer by introducing judicious indenting as 
is shown in (1.3), that is, by making an indented list. 


(1.3) [S 
[pron Ae] 
[VP 
[V he-hit] 
[PP 
[prep [nota acc.] ] 
[PN Philistines] 
] 
[PP 
[prep until] 
[PN Gazah] 
] 
] 
] 


In this representation, we keep the words and their part-of-speech labels together with their enclos- 
ing brackets on the same vertical line. The constituents in the column immediately to the right of 
a given constituent down to its closing bracket are called its immediate constituents (ICs). Thus, 
for example, the immediate constituents of S (the sentence) are a pron (pronoun) and a VP (verb 
phrase). The VP has three ICs: V (the sentence's verb) plus two PPs (prepositional phrases). The 
first PP has two ICs: the nota accusativi (object marker) and a proper noun. Even this representa- 
tion is awkward to read. Enter the phrase marker. . . . 


1.3.2 The Phrase Marker 
Suppose that we link each constituent with each of its immediate constituents via an arrow from 


the former to the latter and that we do away with the now superfluous brackets. Then the representa- 
tion in (1.4), mathematically equivalent to a labeled bracketing, is obtained. This is a phrase marker. 


3]. For now, so as to parallel standard works on syntax, we have chosen to name the most inclusive constituent the 
sentence. However, as our exposition develops below, our major unit of syntactic structure will be termed the clause. 
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(1.4) S 


pron ——————9 he 


lr 


VP —————» V —————» he-hit 


7 


PP —— prep —— ———» [nota acc.] 


PN — —— ——»* Philistines 
PP ——* prep ———— — until 


PN —— — —— Gazah 


Unlike a labeled bracketing, a phrase marker is easily understood. For example, in phrase marker 
(1.4), we readily see that the proper noun ‘Gazah’ combines with a preposition (‘until’) to form 
a prepositional phrase (PP = ‘until Gazah’), which is one of three immediate constituents of the 
sentence's verb phrase (VP = ‘he-hit [nota acc.] Philistines until Gazah’). 

Mathematically speaking, a phrase marker is a tree. If the phrase marker is counter-rotated by 
ninety degrees so that the S is at the bottom (the tree root) and the words are at the top (the tree 
leaves), then the reason for calling the structure a tree will be obvious. In linguistics textbooks, 
phrase markers are invariably displayed with the root at the top and the leaves at the bottom (i.e., 
the tree is upside down). We opt for placing the tree on its side with its root to the left so that long 
clauses can be displayed down the page or screen rather than across. 


1.3.3 The Enhanced Phrase Marker 


1.3.3.1 Enhancements Made 

Being fashioned in the traditional way, phrase markers of the sort presented above show how 
larger and larger constituents are built up until they finally combine to show the structure of entire 
clauses. 

As we developed our syntactic representation, we asked what additional information might be 
incorporated in a phrase marker to enhance its usefulness for research. One interesting and well- 
developed syntactic theory, /exical functional grammar, argues that a complete presentation of syn- 
tax must supply both a picture of the hierarchical makeup of clauses and a quite separate catalog 
of essential information regarding grammatical functions. “[Functional]-structure is composed of 
attributes (features and functions) and their values." °? Included in functional-structure is informa- 
tion specifying the subject, direct object, tense, and so on, associated with a clause. 

Much functional-structure information can be straightforwardly grafted onto traditional phrase 
markers, as can information regarding the semantic characteristics of the constituents. To accom- 
plish this enrichment (and make our phrase markers more readable), we make these changes and 
enhancements to traditional phrase markers: 


. Add the Hebrew text to the left of the English glosses. 

. Replace S (sentence) by cl (clause) in the root node at the far left of the phrase markers. 

. Delete the VP (verb phrase) constituent, joining its constituents directly to the root node. 

. Then, to the immediate right of the root node, insert a set of function-specifying labels, 
short yet informative (e.g., dir obj) rather than acronymic (e.g., do), that provide the names 


BRWN eR 


32. Y. N. Falk, Lexical-Functional Grammar: An Introduction to Parallel Constraint-Based Syntax (Stanford, CA: 
CSLL 2001) 16. 
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of the clause immediate constituents (CICs).?? 

5. Enrich the label information by introducing a two-line format. The upper line describes 
the form or function of the constituent. When form is being described, explicit semantic 
information is included. The lower line gives the licensing relation justifying the 
assembling of the constituent. 


The resulting structures might properly be termed enhanced phrase markers, but for brevity we will 
simply continue to call them phrase markers. 
Transforming the traditional phrase marker (1.4) yields the (enhanced) phrase marker (1.5).?* 


(1.5) cl sbj — 
oblq C gram i 
vb a 
gram nan hehit 
dir obj obj mk + humn " 
gram prep SCORE NR [nota acc.] 
Dnw Philistines 
p gie “Ty until 


gram prop iin N 
my Gazah 


The innovations given in the previous list are realized in this phrase marker as follows: 

Item £1: Hebrew text. Self evident. 

Item #2: “Clause” as Root Constituent. In a simple substitution, cl replaces S. Along with 
the newly minted cl in the upper line of the root node label, a new shortened form has appeared in 
the lower line: oblq. This is short for obliqueness, a topic taken up in $1.3.3.3. 

Item #3: Deletion of the Verb Phrase. In most modern textbooks on syntax, one of the fixed 
points is that the root label (be it symbolized by S or cl) in the phrase marker for a declarative sen- 
tence always has two and only two immediate constituents, NP and VP. The former functions as 
the clause's subject, while the latter functions as the clause's predicate. This is the orthodox view. 
Practitioners of lexical functional grammar and others, however, suggest that holding this binary 
view of sentence structure is counterproductive when certain non-English-like languages, so-called 
nonconfigurational or fluid languages, are being investigated. We address this topic in $7.2 and 
$7.3.3. We trust that until then readers will, however uneasily, accept our jettisoning of the use of 
VPs for Biblical Hebrew. 

Items #4 and £5: CIC Insertion and Label Format. The inclusion of function-specifying la- 
bels is a major innovation. This clause has the four clause immediate constituents (CICs) named 
in the upper lines of the CIC labels. Their expanded names are as follows: 


33. Clause immediate constituents (CICs) are the major constituents in clauses. They are treated at great length in 
chaps. 9-16. Clause immediate constituents have also been referred to as clause-rank constituents in John H. Connolly, 
Constituent Order in Functional Grammar: Synchronic and Diachronic Perspectives (Berlin: Foris, 1991) 28. The eas- 
ily pronounced acronym CIC (“kick”) is our innovation. 

34. This and all of the subsequent phrase markers in this book have been exported from the Andersen-Forbes 
Phrase Marker Analysis of the Hebrew Bible, as displayed by the Libronix Digital Library System, Logos Research 
Systems. 
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CIC Label Expanded Name 
sbj subject 
vb finite verb, one kind of predicator 
dir obj direct object 
mvt aim movement aim 


The subject is often said to be “what in the clause is already known” (its theme). Traditionally, it 
is the doer of the action in a clause. The predicator is not the traditional predicate.? Rather, it is 
a verbal or quasiverbal constituent that specifies an equivalence, a state, an activity, or a process. 
In the present example, the predicator is a finite verb. The direct object is that which is affected by 
the action specified by the predicator. It is the *undergoer" of the action of the predicator. In Bibli- 
cal Hebrew, the direct object is often marked by the nota accusativi, as it is here. The movement 
aim is the destination of the activity specified in the clause. The lower lines of the CIC labels each 
read gram. This indicates that a CIC classification has been assigned on the basis of grammatical 
knowledge. The significance of this label is discussed in $1.3.3.3. 


1.3.3.2 Construction-Identifier Label Formats 

The construction-specifying upper lines in the labels on non-CIC nodes come in two flavors: 
one for prepositional phrases and one for noun phrases. 

Prepositional Phrase Construction Format. Our example clause has two other labels to the 
right of the four CIC labels just explained. These non-CIC labels are in the prepositional phrase 
format. The upper line in the label on the first prepositional phrase constituent '[nota acc.] + Phi- 
listines’ reads obj mk+humn, meaning that the prepositional phrase consists of the object-marking 
preposition plus (+) a noun having human semantics. The upper line in the label on the preposi- 
tional phrase “until + Gazah" reads until+geog, meaning that the prepositional phrase consists of 
the preposition "until" plus a noun having location / geographic semantics. In both labels, the lower 
line of the label reads prep, indicating that we are dealing with a prepositional phrase gathered to- 
gether (“licensed”) because of the presence of a preposition. 

Noun Phrase Construction Format. The noun phrase format is different. Consider phrase- 
marker fragment (1.6) from 2 Chr 29:32. 


(1.6) pes — — "3 cattle 
Dn D'U3U seventy 

nn DDR rams 
=. NRA hundred 


Dw» lambs 


mod Se 
wN two hundred 


35. The term predicator has a long history. For example, in 1961, Halliday used it as we do: M. A. K. Halliday, 
Categories in the Theory of Grammar (Indianapolis, IN: Bobs-Merrill, 1961) 257. 
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The constituent is a noun phrase with animal semantics (n:anml) produced by juxtaposing (juxt) 
its three subconstituents. Each of the three subconstituents is itself a noun phrase with animal 
semantics (n:anml) produced by modifying (mod) the first segment in each phrase with the next 
segment. 

The upper line reads n:anml, indicating that the constituent formed is a noun (phrase) further 
specified (:) as involving animal semantics. 
1.3.3.3 The Licensing Relations of Grammar and Obliqueness 

When we say that the assembling of a prepositional phrase is licensed by the presence of an 
initial preposition or that the assembling of a construct phrase is licensed by the presence of an ini- 
tial substantive in the construct state, the underlying grammatical mechanisms being described are 
clear. But when we say that the assigning of some label to a clause immediate constituent (CIC) is 
licensed by grammar [gram] or that the assembling of a clause is licensed by obliqueness [oblq], 
the underlying grammatical mechanisms are unspecified. 

Given our adoption of a two-line format for the node labels in our phrase markers, we had to 
assign some licensing relations to justify the classification of CICs and the assembling of clauses; 
but which ones and, more importantly, with what meanings? We might have simply installed the 
label dummy, explicitly signaling our initial ignorance of the what, how, and why of CIC clas- 
sification and clause assemblage. Instead, we opted for grammar and obliqueness, allowing us to 
search for each licensing relation independently. The relations grammar and obliqueness stand for 
the very disparate and still obscure mechanisms that provide the grounds for classifying CICs and 
assembling them into clauses. Here is the critical point for our work: 


Gathering the data needed to begin working out the grammatical mechanisms underlying 
“grammar” and “obliqueness” is the focus of the latter two-thirds of this book. 


1.4 Brief Summary 


Text Source. We use the Kethiv text of the Leningrad Codex, “obvious errors" having been 
corrected, cantillations having been omitted, lexical and structural ambiguity having been resolved 
in favor of "salient readings," and blocks of Aramaic text having been set aside. 

Our Grammatical Stance. Where possible, we build on traditional grammars of Biblical He- 
brew. With regard to modern linguistics, we are eclectic. We ally ourselves with the corpus linguists 
but also find much of value in the work of the generativists, especially generalized phrase structure 
grammar. We reject the notion of the autonomy of syntax and bring in semantics and discourse 
analysis as needed. We iteratively rely on both bottom-up and top-down analyses. 

Visualizing Structure. Phrase markers, suitably extended, allow us to present our analyses 
graphically. Our phrase markers consist of labeled nodes and their connecting edges. The node 
labels disclose the functions and forms of constituents, as well as the grammatical principles li- 
censing their creation. Also included is information regarding the semantic classifications of the 
constituents. 

Of critical importance to our investigations are the clause immediate constituents (CICs), the 
immediate offspring of the clausal nodes (“roots”) in the phrase markers. Central foci of our book 
are the characteristics of two pivotal licensing relations: (1) gram, which guides the specification of 
the functions of the CICs, and (2) oblq, which constrains the assembling and ordering of the CICs 
in clauses. 


Chapter 2 


Text Division 


In chap. 1 we used, without comment, some examples wherein the basic units for syntactic 
analysis were parts of words, a word being conventionally defined as a continuous sequence of 
Hebrew consonants and vowels bounded by spacers: (white) space, line end, verse-end marker (sof 
pasuq), or dash (maqqep). In general, our basic unit of syntactic analysis is the segment. A seg- 
ment can be a word (“a free morpheme”), a part of a word (“a bound morpheme”), or a sequence 
of words. (For the place of morphology in our work, see appendix $A2.3.1.) 

We dissect prepositions and pronoun suffixes from nouns. For example, above (81.3.1) we 
analyzed ƏN into three segments i + 75 + /D ‘from’ + ‘fruit’? + ‘its’. We also detach definite 
articles and conjunctions. For example, 012 is made up of three segments: 


DP +_+ 3 ‘in’ + ‘the’ + ‘day’ 
while mwami consists of these three segments: 
D+ war +1 ‘and’ + ‘they-seized’ + ‘them’ 


This example shows that we separate one-consonant segments (the conjunction 1) and object pro- 
noun suffixes from verbs but not the subject pronoun affixes of verbs. Segments are also formed by 
the ligaturing of words, almost invariably forming proper nouns. For example, in our analysis, we 
ligature the two-word sequence 5s-rv3 to form a single segment glossed ‘Bethel’. 

It should be clear that we do not accept the lexicalist hypothesis: “the view that rules of syntax 
may not refer to elements smaller than a single word."' As is often the case in Anglo-American 
linguistics, this hypothesis is “English-o-centric.” Rejection of the lexicalist hypothesis is war- 
ranted for Biblical Hebrew because its words commonly consist of sequences of morphemes, each 
having a distinctive role in syntax. For example, were we to treat 7X71 in Gen 24:53 as a single 
constituent, important syntactical structure would be hidden. We therefore divide this word into 
four segments: 


A+’ +79 +^ ‘and’ + ‘to’ + ‘mother’ + ‘her’ 


We divide 1,258 four-segment words, 20,759 three-segment words, and 120,007 two-segment 
words into their segments. 


1. Robert L. Trask, A Dictionary of Grammatical Terms in Linguistics (London: Routledge, 1993) 157. 
15 
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2.1 Words, Segments, and Ligatures 


Having made the global decision to segment the words making up the biblical text, we were left 
with a large number of local decisions. We needed to choose which words should be segmented or 
joined and how. Making these choices required repeatedly answering three questions: 


1. Segmentation. Should a given word be segmented or left whole? 
2. Selection of cut-point. "Where should a multisegment word be sliced? 
3. Ligaturing. "When should a sequence of words be ligatured? 


2.1.1 Segmentation 

An example from English may help readers understand what the "segment /no segment" deci- 
sion involves. Consider the word “tomorrow.” This noun evolved from a prepositional phrase.? It 
is instructive to examine this word in Exod 17:9 as printed in three English versions. The KJV uses 
the term “to morrow,” the asv has “to-morrow,” and the Rsv has “tomorrow.” Were we to perform a 
syntactic analysis of the verse in English, how would we represent *tomorrow"—-as two segments 
or as one? The answer depends on the word's lexical status when the version was published. 

Our decisions regarding word segmentation are mostly intuitive, with a few exceptions.? Our 
segmentation of one sort of word deserves comment. We always segment 57 into 15 + ^5 ‘to’ + 
‘face-of’. We do so because we, and most writers of Biblical Hebrew grammars, view 7159 as a 
preposition “prefixed to the substantive 0°39 in the construct state.”* The English translation “be- 
fore’ has lexicalized the compound.? Compare “in spite of,” “instead of," and “in place of.” The 
preposition "359 is not the only word of this kind in Biblical Hebrew. 


2.1.2 Selection of Cut-Point 

Early in our work, we segmented the texts ad libitum, that is, “by the seats-of-our-pants.” This 
approach was indescribably tedious, and the results were unacceptably inconsistent. We therefore 
enunciated a set of sequentially applied rules that were at first manually implemented. This manual 
approach increased the tedium and, surprisingly, led to results that were not particularly consistent. 
Our next gambit was to have the computer enforce consistency. The computer was programmed to 
move an arrow along through the text, pausing at positions where a word cut might be made, and 
waiting until the human analyst accepted or rejected the proposed segmentation. If the segmenta- 
tion was accepted, then an arrow was left in position showing the cut, and the text file was adjusted 


2. E. C. Traugott, “Grammaticalization and Lexicalization,” in Concise Encyclopedia of Grammatical Categories 
(ed. K. Brown and J. Miller; Oxford: Elsevier, 1999) 182. 

3. For specifics, see E I. Andersen and A. D. Forbes, A Linguistic Concordance of Ruth and Jonah (Wooster, OH: 
Biblical Research Associates, 1976) 23-26. 

4. Bill T. Arnold and John H. Choi, A Guide to Biblical Hebrew Syntax (Cambridge: Cambridge University Press, 
2003) 115. Other works referring to the complex nature of "25 include GKC, 377; R. J. Williams, Williams’ Hebrew 
Syntax (3rd ed.; Toronto: University of Toronto Press, 2007) 135-36; Bruce K. Waltke and M. O'Connor, An Introduc- 
tion to Biblical Hebrew Syntax (hereafter IBHS; Winona Lake, IN: Eisenbrauns, 1990) 221. Some treat 355 as unitary, 
as in C. H. J. van der Merwe, Jackie A. Naudé, and Jan H. Kroeze, A Biblical Hebrew Reference Grammar (Biblical 
Languages: Hebrew 3; Sheffield: Sheffield Academic Press, 1999) 287. 

5. Lexicalization is “the process or result of assigning to a word or phrase the status of a LEXEME," according to 
R. R. K. Hartmann and Gregory James, Dictionary of Lexicography (London: Routledge, 1998) 84. 
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accordingly. Figure 2.1, a black-and-white photo of our computer monitor screen, was made in 
early 1971.° It shows the opening words of the book of Ruth with the segmenting arrows in place. 


woe 2 'n", 
Ay) OT) Op wa 


O'R CT pa 


Figure 2.1. 


The pair of arrows at the beginning of the third line flanks the definite article represented by * ." 

Our final and most accurate approach to segmentation involved “bootstrapping.” The computer 
used the segmentation manifested in a stretch of correctly segmented text to segment new, previ- 
ously unseen text. There were arrow-insertion criteria and arrow-exclusion criteria. The new results 
were then manually corrected, and the newly checked results were used to segment the next block 
of text. And so on. This approach yielded quite high-quality results that were then perfected in the 
process of building a computerized, lemmatized dictionary for the Hebrew Bible. 


2.1.3 Ligaturing 

We deal with ligatured words every day. Consider *New York." This proper noun has been lexi- 
calized for a long time and so, in our system, would be declared to be a single segment. In the case 
of Biblical Hebrew, we have declared hundreds of proper nouns, plus four common nouns and one 
subordinating conjunction, to be lexicalized. We have joined adjacent words to produce 382 distinct 
segments. Two-thirds of these appear only once in the MT. The most frequently occurring ligatured 
item is the subordinating conjunction DX "dD ‘except’ (120x). The second most-frequent is forms of 
5wma ‘Bethel’ (72x), and the third is op? ma ‘Bethlehem’ (41x). By our analysis, Biblical Hebrew 
contains almost one thousand ligatured segments. 


2.2 Chunking the Text into Clauses 


2.2.1 Rule-Based Clause Onset Detection 

For computational parsing, the largest appropriate units are clauses. In our approach, a clause 
typically consists of a predicator and the constituents that accompany it, a predicator being a verbal 
or quasiverbal constituent that specifies equivalence, activity, state, or process. A single finite verb 
can be a whole clause.’ We have explained elsewhere just how we determined clause boundaries.’ 
Here we briefly provide the flavor of that work. 


6. The pin-cushion distortion in the photo is due to the monitor screen's non-planarity. Four decades later, it is 
amusing to recall how inordinately pleased we were with our pathetic little unkerned Hebrew stick characters! 

7. Even a single constituent can form a “verbless” (or *nominal") clause lacking a predicator. See chap. 19. 

8. EL Andersen and A. D. Forbes, “On Marking Clause Boundaries,” in Bible et Informatique: Interprétation, Her- 
méneutique, Compétence Informatique (Paris: Honoré Champion, 1992) 181—202. On the general question, see Marjo 
C. A. Korpel and Josef Oesch, eds., Unit Delimitation in Biblical Hebrew and Northwest Semitic Literature (Pericope 
4; Assen: Van Gorcum, 2003). 
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We relied on a dozen ordered rules for detecting clause onset—clause-offset detection being a 
more difficult problem. But, of course, a true main clause onset must coincide with a true offset, set- 
ting aside the first clause in the Bible. Our rules are highly heuristic. To provide a sense of our ap- 
proach, we quote four of the rules from our referenced essay, tell how often they were used (“how 
often they fired”), and tell their individual accuracies across Biblical Hebrew. 


1. Rule A. “The quoting formula 789 is usually followed immediately by a quoted 
speech.” This rule fired 939 times, 929 of them correctly—a 99% true positive rate. There 
were 10 places where a speech did not immediately follow the quoting formula. 

2. Rule B. “A waw-sequential construction usually begins a new clause.” In 20,691 of its 
20,907 firings a new clause begins—a 99% true positive rate. 

3. Rule F. “When the first word in a verse is a predicator, it is likely that it begins a... 
clause." This rule fired 2,520 times, 17 incorrectly—a 97% true positive rate. 

4. Rule L. "Each new chapter probably begins a new clause." This rule fired 929 times, once 
"incorrectly"—a 99.9% true positive rate. The verse prior to Jer 3:1 ends with a complete 
and well-formed clause. Then Jer 3:1 begins with a stranded 728). 


Our suite of rules correctly found almost two-thirds of the clause onsets with very few false onset 
detections but with a fair number of fragmentary "clauses." Completion of the task of clause isola- 
tion required careful human over-reading. 


2.2.2 Clause-Boundary Ambiguity 
Clause-onset position can be ambiguous. Consider three clauses from Exod 17:9: 


AVIA UNT p AVIAN IND p?np3 on?n NY) 
The Nps reads: 
*. .. go out and do battle with Amalek. Tomorrow I will station myself on the top of the hill..." 


But it might also be rendered:? 


[11 


... go out and do battle with Amalek tomorrow. I will station myself on the top of the hill...” 


Whether a clause boundary should occur before or after "tomorrow" is formally ambiguous. In 
circumstances of this sort, we have the choice of either somehow representing both clause divisions 
or selecting and representing the more compelling division. We decided to provide mechanisms for 
handling the former option but, in this first pass at analysis, represented only the latter. This choice, 
however, leaves us to decide which of the several options is preferred. For the present example, 
we follow the cantillations, as do all seven English Bibles consulted, and divide the clauses before 
"tomorrow." We let unanimity rule, mindful that neither cantillations nor scholarly consensus is 
always correct. 


9. Bear in mind that we have opted not to be constrained by the cantillations. The athnah in Exod 17:9 encodes 
a pause before “tomorrow,” indicating that the Masoretes divided the verbs into two clauses at this point. See further 
Emanuel Tov, Textual Criticism of the Hebrew Bible (2nd ed.; Minneapolis: Fortress, 2001) 68—69. 
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2.3 Brief Summary 


Segments. Our basic units of analysis, segments, are whole words (“free morphemes"), parts 
of words (“bound morphemes"), or ligatured words (“lexicalized phrases"). Delimitation of seg- 
ments relied on computational “bootstrapping” and consistency enforcement followed by correction 
by an expert. 

Clause Delimitation. Delimitation of clause boundaries involved computational application of 
a set of heuristic clause-onset rules followed by correction by an expert. 


Chapter 3 


Parts of Speech 


3.4 Approaches to Parts-of-Speech Specification 


The grammatical category of a segment is its part of speech. The parts of speech (POSs) in a 
language form a system. Many are definable only when considered in relation to one another. A 
systems approach is essential, especially when multiple-part-of-speech homography is considered. 

The taxonomic problem requires that the analyst execute three tasks: (1) propose a part-of- 
speech system, (2) decide when segments need to be formed by word dissection or ligaturing, and 
(3) assign each resulting segment to its POS category. 

Two crucial points need to be appreciated: 


* The taxonomic tasks are addressed iteratively. One does not deal with task 1 fully, execute 
task 2 perfectly, and then carry out task 3. Rather, in dealing with tasks 2 and 3, one may 
discover aspects of the POS system that need adjustment. These adjustments, in turn, may 
require changes to segmentations and classifications that were made previously. 

* Carrying out the work involves sophistication ranging from matching up simple patterns to 
exegesis and reliance on world knowledge. The pretense of bottom-up analysis is just 
that—a pretense. In what follows, we will provide many examples illustrating this point. 


Parts of speech can be specified in five ways: ostensive (by presenting constrained lists), semantic 
(based on “meaning”), derivational (transforming a segment affix), paradigmatic (based on form 
patterns), and distributional (based on segmental environments). ! 
3.1.1 Ostensive Specification 

Ostensive (or, ostensible) specification (simple listing) is appropriate for some parts of speech. 
For example, we may specify the major? free pronouns (non-suffixing) by this list and its attendant 
constraints: 

Any of these strings is a free pronoun 


I WI NDA DN  DnN NIDN ND NU nan (REIR 


if it is preceded by [a spacer or a form of and or a n-form of the] 
and it is followed by a spacer. 


1. Robert L. Trask, “Parts of Speech,” in Concise Encyclopedia of Grammatical Categories (ed. K. Brown and 
J. Miller; Oxford: Elsevier, 1999) 280—82. We have added “ostensive” to Trask’s list. 

2. Only one allomorph of each is shown here. In order to specify the free pronouns exhaustively, we would need to 
add 16 additional character strings that are here omitted from the table for the sake of simplicity. 


20 


3.1 Parts of Speech 21 


The foregoing illustrates the holistic nature of taxonomic assignment. The just-stated rule finds the 
free pronouns, but it does so only if one can reliably find and as well as the n-form of the. 
Identification of the coordinating conjunction and is straightforward: 
Any of these forms is an and if it follows a spacer in Biblical Hebrew: 


This rule correctly identifies coordinating conjunctions 99.94% of the time. It is incorrect 30 times 
and correct more than 50,000 times. If we seek perfection, then we should exclude from consider- 
ation all 30 occurrences of the following simple segments: 


m am cm Y wm cn nm» m wh wv 


The modified rule perfectly isolates all 50,000-plus ands in Biblical Hebrew. But the rule is not 
as simple as it might seem, for we have not discussed on what basis we chose the 10 segments to 
exclude. Three words in the string just above are common nouns (‘child’, ‘hooks’, and ‘hooks of’), 
and the other 7 are proper nouns. But we have not yet provided diagnostics for common and proper 
nouns. Nor have we discussed how we came to know the possible forms for and. Since traditional 
grammars are most reliable when they deal with functional categories, we accept their identifica- 
tions of the forms of and that are identical with the forms specified above. 

The specification of n-forms of the is, unfortunately, quite complex. We accept the traditional 
view that the n-forms of the definite article are -7, 3, and -]. Had we assumed that any word-initial 
occurrence of these characters should be identified as a definite article, then our false-positive er- 
ror rate would have been 11%, which is unacceptably high. Having carried out the classification, 
we know that -7, -7, and - are also word-initial in instances of 16 other parts of speech. We need 
to constrain the rule for the definite article to avoid misfiring in these cases. This assumes that we 
have worked out criteria for recognizing the alternate categories. Once again, we see the interac- 
tions encountered when one attempts to specify the parts of speech and identify their members. 


3.1.2 Semantic Specification 

Basing part-of-speech classification on meaning is unreliable. According to Trask: “Though 
popular in the past, this criterion is rejected today, since it is hopelessly misleading: lexical catego- 
ries are syntactic categories, not semantic ones, and the meaning of a word is at best no more than 
a rough guide to its likely word class."? For the most part, we steer clear of meaning as an approach 
to classification. 
3.1.3 Derivational Specification 

Derivational specification is quite rare in Biblical Hebrew. We recognize only two kinds of seg- 
ment obtained by adding derivational suffixes to other segments. 
3.1.3.1 He Locale 


Adding a 7- onto a noun transforms it. When a common noun is involved, the result is a direc- 
tional adverb. Example: 4848 *land-ward'. When a proper noun is involved, the result is a movement 


3. Trask, “Parts of Speech," 280. 
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aim or target. Example: n7 *Egypt-ward'. In Biblical Hebrew, we count the former 427 times 
and the latter 406 times. Of course, 7— is a very common word ending (occurring almost 20,000 
times), so we cannot provide a simple rule to identify this phenomenon. 


3.1.3.2. Suffixed o- 
The suffix D- is added to a common noun to derive an adverb 114 times. This suffix contributes 
only these 6 segments to our dictionary: 


DIX truly DIX truly omit silently oin freely oa by day — oj? in vain 


Because D- is a common word ending, we attempt no rule(s) to identify this phenomenon. 


3.1.4 Paradigmatic Specification 

Paradigmatic specification relies on form patterns exhibited by various parts of speech. It is the 
most popular way of specifying parts of speech among the writers of grammars.* 

Some patterns are precisely diagnostic, while others reduce the possibilities to a manageable 
few. For example, suppose that we specify the Qal suffixed Ist-pers. sing. verb as follows: 


A segment consisting of a consonant with a qames 
a consonant with a patah 
a consonant with a šewa 
followed by n- 
then followed by a spacer 
is a Qal suffixed Ist-pers. sing. verb. 


This rule correctly specifies verbs having the shape ^os ‘I said’ in 456 of the 1,390 instances of 
Qal suffixed Ist-pers. sing. verbs (33%). To specify the remaining two-thirds, additional rules must 
be written. For example, if we specify words of the shape 7187 ‘I saw’, then we classify an addi- 
tional 280 such verbs (another 2096). And so on through the many paradigms. 


3.1.5 Distributional Specification 

Distributional specification relies on segment environments to infer the relations among parts 
of speech. The traditional approach to distributional analysis relies on frames and on the judgments 
of native speakers. A native speaker is provided a frame and asked what items can fill its gap. Tak- 
ing an example from Crystal? one might be given the frame “She saw box" and asked what 
words can fill its gap. Crystal asserts that the words that can fill this gap are determiners: {the, a, 
my, one, etcj. We observe, however, that if box is taken to be a verb, then a quite different set of 
words can fill the gap: (Joe, boys, men, kangaroos, etc}. In general, there are better ways to get 
at distributional patterns than via frames—ways that do not require native-speaker discriminations. 
In practice, these methods can only be reliably investigated computationally. Forbes has made two 


4. See the tables of paradigms scattered throughout GKC; through van der Merwe, Naudé, and Kroeze, A Biblical 
Hebrew Reference Grammar (Sheffield: Sheffield Academic Press, 1999); and at the end of Paul Joüon and T. Muraoka, 
A Grammar of Biblical Hebrew (rev. Eng. ed.; Rome: Pontifical Biblical Institute, 2006) 617—49. 

5. David Crystal, A Dictionary of Linguistics and Phonetics (5th ed.; Oxford: Blackwell, 2003) 188. 
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Figure 3.1. Major Grammatical Categories of Biblical Hebrew 


forays into this area.° If progress in this area continues, it should be possible to combine paradig- 
matic and distributional specification in ways that will lead to new insights into the interrelation- 
ships of the parts of speech of Biblical Hebrew. 


3.2 The Andersen-Forbes Part-of-Speech System 


The darkened boxes in figure 3.1 disclose the top level of our taxonomy of Biblical Hebrew. 
This seven-category taxonomy (miscellany, prepositions, conjunctions, adverbials, substantives, 
substantives-verbals, and verbals) is far too coarse-grained for our purposes, so we begin elabo- 
rating it by subdividing the adverbials into five subcategories (modals, adverbs, negatives, closed 
interrogatives, and some open interrogatives), the substantives into four subcategories (some open 
interrogatives, pronouns, proper nouns, and other nouns), and the verbals into two subcategories 
(quasiverbals and verbs). There is little in this taxonomy that would surprise the writers of gram- 
mars, as examination of GKC, BDB, /BHS, Joüon and Muraoka, and van der Merwe et al. has 
confirmed. Three innovations deserve comment: 


6. A. Dean Forbes, “Squishes, Clines, and Fuzzy Signs: Mixed and Gradient Categories in the Biblical Hebrew 
Lexicon," in Syriac Lexicography I: Foundations for Syriac Lexicography (ed. A. D. Forbes and D. G. K. Taylor; Pisca- 
taway, NJ: Gorgias, 2006) 105-39; idem, “Distributionally Inferred Word and Form Classes in the Hebrew Lexicon," in 
Syriac Lexicography II (ed. Peter Williams; Piscataway, NJ: Gorgias, 2009) 1-34. 


24 Chapter 3 3.2 


Table 3.1. 37 Grammatical Categories for Biblical Hebrew 


exclamatives quasiverbals cohortative sequentials 
definite articles pure verb ptcps. imperative verbs 
[nominalizers] UN infinitives absolute insistent imperatives 
prepositions suffixed (perfect) verbs jussive verbs 
conjunctions prefixed (imperfect) verbs | cohortative verbs 


modals proper nouns infinitives construct prefixed (preterite) verbs 
adverbs all 53 noun-verb /noun ptcps. | suffixed sequentials 
negatives numerals noun-verb ptcps. prefixed sequentials 


]. We distinguish closed interrogatives (interrogatives allowing only yes-or-no answers) 
from open interrogatives (interrogatives allowing answers that are drawn from open sets of 
possible answers)." 

2. We recognize a mixed category of verbal nouns: part substantive, part verb. 

3. We have also introduced a subclass of verbals that we refer to as quasiverbals.*® 


The expanded set of grammatical categories tabulated above is still too coarse for our work, so 
we further subdivide categories, obtaining the 37 more fine-grained categories listed in table 3.1. 
Most of these categories will already be familiar to readers. Four comments may be helpful: 


1. We have subdivided the nouns into eight subcategories and the finite verbs into ten 
subcategories. 

2. Because of its odd behavior and to facilitate further study, we have made 49 ‘all’ a one- 
lexeme category.? 

3. We have subdivided the participles (ptcps.) into four subcategories. 

4. Readers wanting a concise definition of a category should consult the glossary. 


We go one step further. We further divide various categories into even finer categories, leading to 
the 76 part-of-speech taxonomy shown in the three-panel display in fig. 3.2. The display consists of 
three major panels. The right-most entries in each panel disclose the most fine-grained categories, 
for example, [/apsii calami], modals, demonstratives. With two exceptions, ? entries that include a 
Hebrew segment are subcategories of traditional “parental” parts of speech—for example, the many 
individual prepositions. 

Readers may wonder why we have more grammatical categories than is traditional. In the case 
of the left-most panel in fig. 3.2, why not simply have miscellany, preposition, and conjunction and 
omit the finer-grained categories? Our splitting of the segments into many categories addresses a 


7. Some of the open interrogatives are classified as adverbials and others as substantives, as we discuss in $9.3.4.3. 
8. Van der Merwe et al. (A Biblical Hebrew Reference Grammar, 58—59) refer to a subset of what we call qua- 
siverbals as “predicators of (non)existence." Waltke and O' Connor (/BHS, 72) refer to these predicators as “quasi-verbal 
indicators." See chap. 18 for a full treatment of quasiverbals. 
9. See Forbes, “Squishes,” 121—22. 
10. The two exceptions are [nominalizers] 7Wx and 52 ‘all’. 
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question bedeviling taxonomists: should one lump segments into a few shared categories or assign 
them to many disparate categories? Since it is trivial to merge sets of segments but nontrivial to 
split them, we have opted to do the splitting work. This way, individuals who want to examine fine 
details may do so, while those who prefer less fine-grained POSs can merge our categories. 

Being “splitters” results in a dilemma. As William Croft put it: “Splitting: where does one 
stop?” !! We answered Croft’s question on the bases of traditional insights and our practical needs. 
(1) In recent times, particles were split from verbs and nouns on morphological grounds; they are 
not inflected. As a default category, particles have diverse grammatical functions. The high ratio of 
particle tokens (segments) in the text to their types (lexemes in the dictionary) indicates that “par- 
ticles” do a great deal of grammatical work. The most important fact is that particles have diverse 
grammatical functions and thus belong to a corresponding range of parts of speech. Additionally, 
each conjunction has characteristic logical functions, so each conjunction is a POS in its own dis- 
tinctiveness. (2) Verbs share similar grammatical functions by virtue of being predicators. Many 
nouns have similar grammatical functions on the basis of shared semantics (e.g., animates may 
be the subjects of clauses). (3) Prepositions have a common grammatical function of “governing” 
nominals, but each preposition makes a distinctive contribution to the semantic roles of preposi- 
tional phrases. Hence each preposition gets its own POS. For practical reasons, we bundled the 
rarely used prepositions into a default POS called “other prepositions.” 


3.2.4 Figure 3.2, Left-most Panel: Particles 


The category particles contributes 108 lexemes to our dictionary, 1.296 of the total. These few 
entries account for 3696 of Biblical Hebrew segments. Many parts of speech making up particles 
involve homography. 


3.2.1.1 Miscellany 

The miscellany set contains four parts of speech that do not fit well elsewhere. 

Lapsii calami. We have 32 ‘calamitous lapses’. Lapsus calami is not a genuine part of speech 
since it is attached to segments, the part of speech of which cannot be determined. 

The lapsii calami are of two kinds: 23 Qere welo’ Kethiv plus nine “true” lapsii calami. 

The Qere welo? Kethiv occur in Judg 20:13; 2 Sam 8:3, 16:23, 18:20; 2 Kgs 4:7, 19:31, 19:37; 
Isa 55:13; Jer 31:38, 50:29; Ezek 9:11; Ps 30:4; Job 2:7; Prov 23:24, 27:24; Ruth 3:5, 3:17; Lam 
2:2, 4:16, 5:3, 5:5, 5:7 (2x). 

The “true” /apsii calami are hapaxes and are distributed as follows: 


Gen 30:11 732 1 Kgs 12:33 T3? Job 26:12 nun 
Deut 33:2 DTN Ps 10:10 mN3?n Lam 4:3 ouy ^3 
1 Sam 4:13 T Ps 55:16 nim» 2 Chr 34:6 omna ana 


Exclamatives. Our set of exclamatives contains 20 lexemes and accounts for 225 segments in 
our Biblical Hebrew text. The list shows our dictionary headings along with their glosses. 


11. William Croft, “Parts of speech as language universals and as language-particular categories,” in Approaches 
to the Typology of Word Classes (ed. P. M. Vogel and B. Comrie; Berlin: de Gruyter, 2000) 76. 
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jax alas "w oh xix O! mn hah 
ax O! Tx how! 1 O0! in ho 
Jax abrek 5x alas Nn hey "in hoy 
ang ahah aX amen nmn hey nin behold! 
nx sigh MAX certainly ta oh ng how! 


Note that specification of some of the lexemes involves resolution of homography. For example, 718 
appears as a word—8 "father-my' [a two-segment word]—and as three different lexemes: "3m ‘fa- 
ther’ [suffixed masc. sing. common noun], ^28 ‘I will bring’ [Hiphil active Ist-pers. sing. prefixed 
verb, variant form], and ^aN *O" [exclamative]. Twice, we have classified ’1& as an exclamative, in 
] Sam 24:12 and in Job 34:36. 

Definite articles. Three n-form definite articles were specified in $3.1.1 above. The definite 
article POS also includes three segments that lack the n: - -, - -, and - - 

Nominalizers. The principal segment that aoininalizes and thereby embeds an immediately 
following clause, converting the clause into a noun or noun equivalent, is W8 ‘which’.'? Often 
referred to as a relative pronoun, Ws nominalizes 5,451 times. The form also exhibits homography 
and functions as a subordinating conjunction 51 times. Contrast these clauses: 


ORS WW NPAT NR a" 
He set the rods that he had peeled . . . (NRsv, Gen 30:38) 


WR] NNW ADIT DY ON 103 
God has given me my hire because I gave my maid to my husband. (NRsv, Gen 30:18) 


In the first clause, WW nominalizes the following clause, which consists of only a finite verb. In 
the second (partial) clause, WX is a subordinating conjunction. The phrase marker (3.1) shows the 
three clause immediate constituents (CICs) for the clause in Lam 2:17. Note how *?UN nominalizes 
the one-word clause consisting of a finite verb, glossed *which he planned'. 


(3.1) able — Mes ny he made 


N sbj mm Yahweh 


gram 


dir obj nom 
— 
gram nom 


"UN which 


b a s VD D! he planned 


3.2.1.2 Prepositions 

Our set of prepositions is smaller than most other grammarians’ sets, but none of ours should 
surprise readers. As usual, there is much homography. For example, various forms of ns '[obj ]’ and 
nsx ‘with’ have been resolved. Regarding segmentation, we left compounds unsegmented ijf their 
nominal components were never attested with nominal functions and their original literal meaning. 


12. Our other nominalizers are: nt, 8x; it, 1x; 1, 15x; -w, 135x. We do not recognize -7 as a nominalizer. 
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Hence 5533 ‘on account of’ remains intact, since 953 is attested only in this compound (BDB, 164a). 
But %33 often means a literal ‘face’ and so we form 715 + ^5 (these practices alter preposition counts). 
For the record, the category called “other prepositions” has these 13 rarely attested members: 


4nN after pa between mpi for sake of i5 because 
ON near "ma without wa through 72) except 
554 on account of ^ ^p?a except "nar except wn for 


Yap before 


3.2.1.3 Conjunctions 

Our set of conjunctions involves greater delicacy than is typical. Their behavior is complex. 
They can operate within phrases, within sentences (joining main clauses), and within discourses. 
Further, as has been shown elsewhere, ? forms such as 03/03 exhibit very complex distributional 
patterns. 

Once again, there is ample homography to be described. Three situations are prominent: 


1. The various forms of the coordinating conjunction ^ ‘and’ are assigned in a context- 
specific way to two subcategories: standard coordination and 1-sequential coordination. 

2. The conditional and interrogative functions of DN: ‘if’ and ‘[question]’ are resolved. 

3. The many discourse-level functions of %3 exemplified by the glosses ‘because’, ‘but’, ‘that’, 
‘although’, ‘when’, and ‘if’ are resolved. 


The category labeled “other conjunctions” is a mélange of relatively rarely attested conjunctions: 


Dax but vwa for sake of nga except IND since 

bx if in if ym therefore y until 
qx also "n except *»»5 perhaps Sy because 
wy because m before 122 therefore apy because 

owa before i» because wn for the sake of ip lest 
V because 


3.2.2 The Middle Panel: Adverbials and Pronouns 


The central panel of fig. 3.2 holds 12 ostensively defined categories plus the modals, adverbs, 
demonstratives, free pronouns, and bound pronouns. 


13. C. H. J. van der Merwe, The Old Hebrew Particle gam (St. Ottilien: EOS, 1990). Also, Forbes, “Squishes,” 
135. 

14. Some grammars assign multiple quasi-POSs to ‘and’ (BDB has five), and the Niv gives it many English glosses. 
See E. W. Goodrick and J. R. Kohlenberger, Zondervan NIV Exhaustive Concordance (2nd ed.; Grand Rapids, MI: 
Zondervan, 1999) 1396-98. By contrast, at present we distinguish only two kinds of ‘and’. 
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3.2.2.1 The Modals 
According to our glossary, a modal is a “segment that expresses the speaker’s uncertainty or 
desire with regard to some statement." We recognize seven modal lexemes: ? 


Jax surely nx would that 42x surely 35 would that 
ix perhaps TR surely %3 surely 


Note that the segments N3/N3 ‘[emphatic]’ are not included among our modals. From the perspec- 
tive of pragmatics, they do exhibit modal behavior. But they—unlike the modals listed above—are 
invariably enclitic. Hence, we have assigned them with reservations to the adverbs category. ' 
3.2.2.2 The Adverbs 

We recognize these 35 adverbs: 


ts then wn quickly T3» apart Tip still 
"inm back mo not yet TRA very nny now 
nx afterward wm together my quickly na here 
minx backwards 733 already nem below Dna suddenly 
TN only n3 thus" byn above pa only 
xia then n33 thus myn upwards ow there 
ANIA farther " very Nl [emphatic] maw thither 
D» here i» thus rm yet nnm underneath 
min here Nb certainly Tip again" 


3.2.2.3 The Negatives 
We recognize 5 negation lexemes: 


"w not ow do not ON not 'm?3 not Nb not 


3.2.2.4 The Interrogatives 


There are these 2 closed interrogatives: DON [question] and ^ yes?/no? 


We also distinguish these 11 open interrogatives: 


15. In addition, we recognize the idiomatic modal expression jp^7n, translated ‘would that’, as a modal. See 
$15.3.4.4. 

16. See GKC, 308 n. 1 and especially n. 2. 

17. Van der Merwe (private communication) holds that 75 is always “thus pointing forward" while 73 is “thus pro- 
totypically pointing backward." 

18. We recognize both a durative sense (‘still’), concomitant with imperfective aspect, and a repetitive sense 
(‘again’), concomitant with perfective aspect. The word Tip has other senses, less prototypical, that we have not yet 
distinguished. 
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mx where? myx where? nw where? vma why? mn what? pn when? 
TN how? 5x where? psn whence? mn what? who? 


Note that the open interrogatives are not included with the pronouns in fig. 3.2 but, rather, are 
considered to be adverbials and/or substantives (traditional grammars usually classify “who?” and 
“what?” as pronouns). Note further that 78 is classified as both a quasiverbal and an interrogative. 
Further, several items are classified as interrogatives and as pronouns. Allowing segments to be 
members of more than one part of speech is a departure from traditional taxonomy. Traditionally, 
the parts of speech are exhaustive, a constraint that we do enforce. They are also mutually exclu- 
sive, a constraint that we jettison to handle segments such as those just listed. One might avoid 
mixed categories by the expedient of simply renaming the mixed classes. For example, a new set 
(containing only one part of speech, mN) named “quasiverbal-interrogative” might be introduced. 


3.2.3 The "Simple" Substantives 

At the bottom of the middle panel of fig. 3.2 are three kinds of pronoun. The right-most panel 
of fig. 3.2 contains 34 categories grouped into 5 supersets: proper nouns, other nouns, verbal nouns, 
quasiverbals, and verbs. In this subsection, we take up the pronoun, proper noun, and other noun 
supersets. 
3.2.3.1 The Pronouns 

Demonstrative Pronoun. We distinguish 5 demonstrative pronoun lexemes in Biblical He- 
brew. Their most frequent forms are: 


common 


| nbs | these 


Free Pronoun. The most frequent forms of the free pronouns in Biblical Hebrew are shown 
in the table below. '?^ We observe that: (1) 797 occurs both as 3rd-pers. masc. pl. (275x) and (incor- 
rectly, so BDB, 241) as 3rd-pers. fem. dual (3x: Zech 5:10, Ruth 1:22, Song 6:8); (2) 13 occurs 
both as the 3rd-pers. fem. pl. free pronoun (47x) and as the adverb ‘here’ (51x). (3) The variant 
form 3393 is both the Ist-pers. common pl. free pronoun (4x) and a suffixed sequential Ist-pers. 
common pl. Qal active verb (once, in 2 Sam 17:12, ‘we will rest’; see BDB, 628a). 


common 
JN I 


WIN | we 


19. Note that we use thee and thou for 2nd-person sing. throughout this work and you for 2nd-person pl. 
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Bound Pronoun. Because we dissect pronoun suffixes so as to minimize the proliferation of 
suffixed forms, we obtain many bound pronoun forms. The table below tallies the number of forms 
attested for each kind of pronoun suffix. For example, there are five suffixed forms of us, and they 
are: 1x2- (1063x), 112- (377x), 13- (4x), 1x?- (168x), and 33%- (18x). 


person masc. fem. common 
12 me 
14 thee 9 thee 
13 his/him| 9 her 
3 you 
3 them 
5 us 
5 you 6 you 
17 them 16 them 


3.2.3.2 The Proper Nouns 

We identify 3,552 distinct proper noun forms that appear a total of 33,963 times in Biblical He- 
brew. As is the case with so many other parts of speech, their classification involves the resolution 
of considerable homography. Example: v2 


“Yabesh’ (1x, name of man; 2 Kgs 15:10) 

‘Yabesh’ (5x, name of city; 1 Chr 10:12) 

‘it dried up’ (8x, suffixed Qal active, 3rd-pers. masc. sing.; Josh 9:12) 

‘it will dry up’ (5x, suffixed sequential Qal active, 3rd-pers. masc. sing.; Isa 19:5) 
‘dry’ (6x, adjective of quality, masc. sing.; Josh 9:5). 


Some work remains to be done on the proper nouns, in that many eponyms need to be divided into 
subsets with assignment to appropriate semantic classes. For example, N^ appears 2,506 times, 
always as the name of a human in our data. It is never classified as a place, and so on. 

3.2.3.3 The Other Nouns 

We divide the other nouns into seven subgroups: 99 ‘all’, cardinal numeral, ordinal numeral, 
adjective, ethnic, common noun, and pure noun participle. The makeup of all of these parts of 
speech, except perhaps adjective and pure noun participle, will be well known to readers (if not, 
they should consult the glossary). 

Adjective. Our definition of the category adjective is highly restrictive in one sense: with very 
few exceptions, we require that any adjective modify an immediately preceding substantive. But it 
is untraditionally inclusive in another sense: demonstrative pronouns and free pronouns, if they are 
made definite and if they modify an immediately preceding substantive, are also classified as adjec- 
tives. For example, in the noun phrase ny7 Di” n, we classify nt as an adjective. 

Pure Noun Participle. Participles are nouns. But they also have affinities with verb paradigms, 
having verbal features such as transitivity and voice. In any context where a participle exhibits only 
nominal characteristics, we call it a pure noun participle and give it noun semantics. We classify 
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4,475 segment occurrences in our text as pure noun participles. For example, in Jer 4:7, in the 
clause aU? PR APN pw, the segment AW? is classified as a pure noun participle and is assigned 
human semantics, since this participle has only nominal features in this context. We will discuss our 
taxonomy of participles in greater detail in $3.2.4.2. 


3.2.4 Verbal Nouns 


The right-most panel of fig. 3.2 contains one mixed superset: verbal nouns. That is, this set of 
categories concurrently exhibits both nominal and verbal behaviors. Because of their mixed nature, 
phrase markers with these items have unique characteristics. In this subsection, we take up the 
verbal noun superset, which consists of: infinitive construct, noun-verb / noun participle, and noun- 
verb participle. 


3.2.4.4 The Infinitive Construct 

Our text contains 6,768 infinitives construct. Around nine-tenths have a prefixed preposition, 
the preposition being some form of prefixed — around three-quarters of the time. The critical point 
is that infinitives construct have functional immediate constituents of their own. That is, they can 
be central to constructions that exhibit the obliqueness phenomenon mentioned in $1.3.3.1. Phrase 
marker (3.2) for Gen 37:25a contains an infinitive construct. 


(3.2) hA — pn 8 —$— $ S A” and 
san yaw they(m) dwelt 
su m 935 s 
obig > gam —> P eat 
ples, — nm? bread 


Note that the infinitive construct (inf cst) 228 ‘eat’ has its own direct object (dir obj) ar? ‘bread’. 
Together, these form a nominalized infinitive constituent (nom inf), licensed by obliqueness (oblq). 
The nominalized infinitive constituent combines with the preposition—2?'to' to form a prepositional 
phrase (to+inf cstr). Functionally, this is a CIC (clause immediate constituent) that specifies the aim 
or purpose for which ‘they(m) dwelt’ = ‘they(m) sat down’. 


3.2.4.2 The Four Kinds of Participles 

The accepted wisdom is that a participle refers to a person’s activity or a state of affairs that is 
present, either coincident (or overlapping) in time with the moment of speech or else occurring at 
the same time as some past event to which it is circumstantial. The participle is thus supposed by 
some to be the nearest thing that Classical Hebrew has to a present tense, increasingly having this 
role as the finite suffixed (perfect) and prefixed (imperfect) verbs settle down to supplying the past 
tense and future tense, respectively. ° 


20. “In Biblical Hebrew the present tense is properly the domain of the predicative participle” (Jan Joosten, “The 
Predicative Participle in Biblical Hebrew,” Zeitschrift fiir Althebraistik 2 [1989] 128-59). 
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However, scanning the categories in the right-most panel of fig. 3.2 from just above the verbal 
nouns down through just below the quasiverbals, one notices that we distinguish four functions of 
participles: pure noun participles, noun-verb participles, noun-verb/noun participles, and pure 
verb participles.” We view the traditional participles as functionally inhomogeneous. They profit 
from subdivision.” 

We introduced the first kind of participle, the pure noun participle, at the end of §3.2.3.3. Recall 
that participles of this kind exhibit only nominal characteristics. ? 

There are constructions in which a word with the form of a participle has verbal relations with 
one or more arguments that follow it, making a constituent that is a verbal clause. When a constitu- 
ent of this sort also functions in nominal relations with a previously occurring segment, we call 
this second kind of participle a noun-verb participle. It is a “noun up-front and a verb out-back." 
We emphasize that it is the construction as a whole—not just the participle alone—that functions 
nominally. This fact has been missed in discussions of constructions that involve determination by 
means of the definite article. In such cases, there is no need to say that the definite article is a rela- 
tive pronoun, let alone talk about a “relative participle.” ” 

The third kind of participle exhibits only verbal characteristics. We call this sort of participle a 
pure verb participle. 

There is a fourth category of participle. When a word with the form of a participle has both 
nominal and verbal relations with following segment(s) in a mixed construction, as well as some 
nominal function with preceding segment(s), then we call it a noun-[verb / noun] participle. It is a 
“noun up-front (‘noun-’), and it is a verb and noun (‘[verb/noun]’) out-back!” The example below 
should make this difficult concept clearer (see *37* as a Noun-[Verb / Noun] Participle, p. 35). 

The different kinds of participles can be appreciated by comparing phrase markers that contain 
the segment Vaw, which exhibits three of the four participle behaviors. We illustrate the structures 
involved with the fourth kind of participle by showing a phrase marker with "3v? ‘dwellers of’. 

vaw as a Pure Noun Participle. In the phrase marker (3.3) from Exod 2:14, the pure noun 
participle is coordinated with a common noun, ^V ‘prince’, forming a noun phrase with human 
semantics (n:humn), licensed by the union/disjunction relation (union/disj). Let us look at this in a 
bit more detail. The coordination phrase (n:humn) is in apposition to V^N ‘man’, and together they 
form a larger noun phrase (n:humn) licensed by apposition (app). The resulting noun phrase com- 
bines with the preposition 9 ‘to’ to form a prepositional phrase (to+humn) that functions as the ob- 
ject complement (obj cmp) in the clause. The clause is made up of five CICs: an open interrogative 
subject (sbj/open intg), a finite verb predicator (vb), a direct object (dir obj), an object complement 
(obj cmp), and an adjunct, location (loc), ‘upon us’. 


21. Note that the pure noun participles are classified as common nouns and the pure verb participles as verbs. 

22. For more discussion, see E I. Andersen and A. D. Forbes, “The Participle in Biblical Hebrew and the Overlap 
of Grammar and Lexicon,” in Milk and Honey (ed. S. Malena and D. Miano; Winona Lake, IN: Eisenbrauns, 2007) 
185-212. 

23. But note that there are instances where a participle is an old-fashioned predicate adjective that can equally be 
seen to be either a pure noun or a pure verb. There is much disagreement in the literature as to whether the "predicate 
adjective" participle is the subject complement in a verbless clause or a predicator in a verbal clause. 

24. BDB recognizes a relative definite article only with perfect verbs. With participles, it calls the definite article 
"resumptive." There is no need for this category either. The definite article with a participle (construction) simply deter- 
mines it, as it does any other nominal. 
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(3.3) d o sbj 


oblq open intrg "D 
vb ad 
gram Dno 
dir obj x 
gram 39 
obj cmp to * humn 5 
gram prep VL f 
n:humn ; 
app N P 
n:humn ; 
union/disj Ww 
N a 
vow 


loc upon + pron "by 


gram prep D 
i] 


who 

he put 
thee(m) 
to 

man 
prince 
and 
judge 
upon 


us 


vaw as a Noun-Verb Participle. In the phrase marker (3.4) from 1 Sam 8:1, the noun-verb 
participle (n-v ptc) has its own beneficiary (benf). The clause has five CICs: a time point (tm pt), 
a sequential 1 (dl seq w), a finite verb (vb), a direct object (dir obj), and an object complement 


(obj cmp). 
RP obla T 26 — ug E '3 like 
™ ok UN which 
™ obla = ping ———— jp old 
xN had —— rinw Samuel 
ga ^ and 
pal DU* he put 


dir obj obj mk + humn 


"DN [nota acc.] 


gram prep re l 
por ERES 71 sons 


obj cmp nom ptc n-v ptc 
—— — 
gram oblq gram 


benf to + humn 
-o 


Ban his/him 


— w"o5U judges 


T. 40 


gram prep De | 
ew? Israel 


vaw as a Pure Verb Participle. The phrase marker (3.5) from 1 Sam 3:13 shows a clause 
consisting of five CICs: a pure verb participle (vb ptc), a subject (sbj), a direct object (dir obj), a 


time aim (tm aim), and a reason (rsn). 
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(3.5) a vb pte”, UO 


oblq gram judge 


sbj 
gram 


dir obj obj mk * bldg TIN [nota acc.] 


gram prep ioe ON : 
n:bldg ine 

suff S house 

| his/him 


tm aim until * time -79 until 


gram prep OU TTE eei 
oiy eternity 


IN | 


M1 | 


rsn in + abst "^ in 
gram prep x 
NS n:abst 
1 iniqui 
app ng quity 
E gi “ÜN which 


nom 


Na Nes PT he knew 


obiq gram 


"av as a Noun-[Verb/Noun] Participle. Phrase marker (3.6) for Hos 14:8a exhibits a phe- 
nomenon not encountered heretofore. The prepositional phrase label in + thing / prep ‘in shadow his’ 
has two arrows pointing into it, one from the nom ptc/cstr label and the other from the loc/gram 
label. As a result, this phrase marker is not a tree. This is an important innovation, but we must 
defer discussing it until chap. 20, *Non-Tree Phrase Markers.” For the present purposes, we note 
that noun-[verb/noun] participles are rare, there being only 109 in our text, and that our phrase 
markers represent the syntactic relations that they enter into nicely (for a proper discussion of noun- 
[verb / noun] participles, see §20.2.1). 


(3.6) nom ptc nom ptc cstr ptc TO . 
cstr ^ " oblq ~~ gram 3" dwelling of 
i loc in * thing — 
\ gram ^ 4 prep 3 in 
NS nithing n 
suf 77 5v shadow 
in his/him 
3.2.5 The Quasiverbals 
We group these five lexemes in a set that we term the quasiverbals:? 
mx where? TN does not exist nin behold w exists Tip still 


The literature rarely refers to quasiverbals. Waltke and O'Connor do state that "[g]uasi-verbal indi- 
cators are particles indicating existence." ?6 In chap. 18, we argue for grouping these items. 


25. E I. Andersen, “Lo and Behold! Taxonomy and Translation of Biblical Hebrew 717,” in Hamlet on a Hill: 
Semitic and Greek Studies Presented to Professor T. Muraoka on the Occasion of His Sixty-Fifth Birthday (ed. M. E J. 
Baasten and W. T. van Peursen; Leuven: Peeters, 2003) 53. 

26. IBHS, 72. 
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Phrase marker (3.7) from Job 9:33 shows an interesting simple clause wherein the existential 
quasiverbal V» ‘exists’ is the predicator. It is negated, rather than using px ‘does not exist’. 


de ok "d ied — —— ®) not 
N mus -W> exists 
bad nd ph pron Ó e ^y3 between 
Uu us 
dui ————————— mn reprover 


3.2.6 Verbs 


At the bottom of the right-most panel of fig. 3.2 are 12 categories of verbs. These categories 
are for the most part quite traditional. There are only two that are not poly-something (polytense, 
polyaspect), wayyiqtol (always sequential preterite and indicative) and imperative (always injunc- 
tive). An entry for each verb category may be found in the glossary. 
3.2.6.1 The Three Functions of the Infinitive Absolute 

Our text contains 860 infinitives absolute. Grammars differ widely in their treatment of the 
syntax of the infinitive absolute. Joüon / Muraoka provide a detailed analysis, recognizing 9 differ- 
ing functions for the infinitive absolute.” We distinguish 3 functions—predicator, intensifier, and 
amplifier—and identify each function as a particular CIC. 

Infinitive Absolute Predicator. When a clause has no predicator other than an infinitive abso- 
lute, we call the infinitive absolute (IA) an infinitive absolute predicator (inf abs prd) CIC. Phrase 
marker (3.8) for Josh 6:3 is an example of this circumstance. This clause has four CICs: the subject 


(3.8) Ed — he go 73 all of 
Ss, ole — —— "QN menof 
“the 
non?m war 
inf ied. prd apn togo around 
jahis obj ja NR [nota acc.] 


ee n.geog "4 the 


CP city 


# times n:qty 
gram app nm DyS occurrence 
nnN one 


27. Joüon and Muraoka, A Grammar of Biblical Hebrew, 390—401. 
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CIC is a construct chain having human semantics, the predicator CIC is the infinitive absolute, 
the direct object CIC involves geographic semantics, and there is an adjunct CIC that specifies the 
number of times. 

Infinitive Absolute Intensifier. When a clause has a finite predicator and also has an infinitive 
absolute having the same “root,” then we call the latter an infinitive absolute intensifier (inf abs int). 
Phrase marker (3.9) for 1 Sam 22:16 illustrates this phenomenon. 


(3.0) cl  , infabs int 


oblq gram ni to die 
vb S 
gram mnn thou(m) wilt die 
voctv : 
gram Tons Ahimelek 


sbj pers pr nN thou(m) 


gram union/disj Se, 
^ and 


all of 


ma house of 


‘G n:humn "AN father 


Ew spo thee(m) 


The CICs are the infinitive absolute intensifier, the predicator, a vocative, and the subject. 

Infinitive Absolute Amplifier. When a clause has a finite predicator and also an infinitive abso- 
lute but with a different "root," then we call the latter an infinitive absolute amplifier (inf abs amp). 
The three-CIC phrase marker (3.10) for Deut 9:12 nicely illustrates this phenomenon. 


(3.10) ob —— pul 0 they turned aside 
inf "ws 72 tohury 
pec — irae ^D from 
bi. "ES ome 
Se TW path 
nom ÖN which 


n 
d — vb — Tis | commanded 


oblq EUN gram 
Br ob; D them(m) 


gram 


Note that the embedded nominalized clause which is part of the movement origin (mvt orig) CIC is 
in apposition to ‘the path’. 
3.2.6.2 The Status of Verb Markup 

This inventory of verb groups is a first approximation. There is, to touch on just one illustrative 
point, a category of “jussive” based in the first place on a morphological distinction (jussives are 
shorter than normal indicatives). But many prefixed forms are homographs (jussive or indicative), 
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Table 3.2. The Semantic Codes of Biblical Hebrew 


V vegetation (plant or part, less edibles) 
+ particle W . work (activity or action) 
S spatial and directional X document 
T time or season Y war term (less humans and weapons) 
Z abstract concept 
Semantics of SUBSTANTIVES # quantity (numeral /weight/ measure / size) 
A appliance or furniture $ riches (money / transaction / property) 
B building (part / structure) ! vocable (noun) 
C creature (bird / insect/ fish) x undecidable / unknown 
D deity 
E event Some Characteristics of VERBS 
F food = existential (be plus most quasiverbals) 
G geographical name or feature a verb of attitude (emotional or moral state) 
H human (incl. ethnic, community group) d verb of destruction 
J jigger (thing) j transitive verb 
K color k intransitive verb 
L livery (raiment, garment, apparel, jewelry) l [temporal locative: 5x] 
M medical (health term) m verb of movement 
N natural substance (incl. light/darkness, weather) | o ditransitive Hiphil 
O occipital (mental) state p passive verb 
P part (body part, fluid, or excretion) r caused motion 
Q noun of quality S stative verb 
S spatial and directional u verb of utterance, less 9N 
T time or season y say [Tax] 
U utensil Z estimative [Xp] 
(implement / relic / weapon / textile / vehicle) 


and so there are many more jussives that need to be identified if the category of jussive as a mood 
is to be of any use. We have proceeded to resolve the ambiguity. 

The assignment of tense and mood—not to mention aspect—is “in process,” acceptable as far 
as it goes in our database, but incomplete. Both tense and aspect are compositional on the clause 
level, so that tense and aspect are features of clauses as such. That is, the tense and aspect of a 
clause is determined not only by the characteristics of its verb but also by relations between it and 
other clauses in the discourse of which it is part. Hence, there is no point in artificially assigning 
tenses and aspects to the verbs as such, just because people used to think (and traditionalists still 
think) that verbs, not clauses, have tense and aspect. 


3.3 Part-of-Speech Features 


Grammars assign descriptive features to each segment making up a text. This enables compu- 
tational parsing, facilitates word-sense assignment, accelerates translation, and aids interpretation. 

In the informal grammars presupposed by the traditional reference works, the feature appara- 
tus is extremely simple. Each particle is featureless; it simply is the part of speech that it is. Each 
substantive is specified by fixing its number, gender, and state. Each verbal is specified by fixing 
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its binyan, mood, number, gender, and person. To these traditional features, we add a simplified 
specification of semantic category where appropriate. 


3.3.1 Marking Semantics 

The semantics feature was added to our data in the mid-1980s to assist computer parsing. The 
categories arose willy-nilly, and no effort has yet been made to make them mutually exclusive. We 
leave our ancient alphabetic mnemonics in table 3.2 to show the “symbol poverty” under which we 
labored! When we assigned the semantic codes, principled taxonomies were beyond our ken. The 
introduction of enriched, even multivalued semantic labels is one of our (too-populated) priorities. 


3.3.1.1 Semantics of Particles 

A few prepositions were assigned spatial semantic values: 3 ‘between’, anm ‘after’, DUN ‘near’, 
223p ‘before’, in ‘inside’, and nnnm ‘under(neath)’. In addition, four subordinating conjunctions 
received temporal semantics: NN ‘after’, 0103 ‘before’, DW ‘before’, and 184 ‘since’. 
3.3.1.2 Semantics of Adverbials 

The adverbs listed in §3.2.2.2 are given either no semantic label (“adverbs of manner ?*): 


minx backward NinN then (logical) Yt together TRA very 
TN only om silently n3 thus ona suddenly 

DINN truly wn quickly "3 very Dp" in vain 

DJON truly Din freely Nb certainly pa only 


or a temporal label (and are termed “adverbs of time"): 


DN afterwards DW not yet ^33 already TTD yet TV again 


TN then on? byday MWM quickly Tip still nny now 
or a spatial label (and are termed “adverbs of space"): 


"inm back Don here T3» apart >va above m4 here maw thither 


nN on farther | nàn here nen below myn upward DU there nnn underneath 


3.3.1.3 Areas for Future Improvement 
With regard to semantics, four areas remain for improvement: 


1. The characteristics of verbs are crudely defined and crudely implemented. 

2. Forcing the semantics feature to have one value is too simple. For example, a golden 
branch should be vegetation (for the branch) and natural substance (for the gold). 

3. We have not worked out and applied a theory of figurative /metaphorical language. 

4. Reliable work on semantic fields should be used to refine the semantic categories. 


28. We do not distinguish a “degree modifier" part of speech, consisting in Biblical Hebrew of %3 ‘very’ and IRA 
‘very’. We include these words with the adverbs of manner. See Trask, A Dictionary, 74. 
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3.3.2 The Assigned Features 


3.3.2.1 Features of Particles and Adverbials 

The parts of speech in supersets particles and adverbials are featureless, except for those that 
have a semantic class assigned as described above. 
3.3.2.2 Features of Substantives 

The features of pronouns are: 


| Number | S = singular | D = dual | P = plural 


| Gender | C = common | F = feminine | M = masculine 


For free and suffix pronouns, we also give person information: Ist, 2nd, 3rd. For demonstratives, 
we give state information of the sort shown below. 
The features of proper and common nouns:” 


D = dual 
M = masculine 


P = plural d = distributive 


Number 
Gender 


S = singular | C = collective 


C = common | F = feminine 


There are 12 possible state combinations: 


indefinite 71,007 3,345 30,374 24,676 
definite with ^1 21.177 1,685 22 6 
definite with * " 5,781 554 4 Isa 24:2 


3.3.2.3 Features of Verbal Nouns 
For the verbal nouns, we always present the binyan and verbal characteristics. This is all that 
we provide for the infinitives absolute. 


The infinitives construct. We also give state information, mostly construct or suffixed. 

The noun-verb | noun participles. All of the instances ("tokens") are in the construct state. 
All but two are masculine. There are 67 plurals and 38 singulars. 

The noun-verb participles. We specify the following information: 


Number S = singular P = plural 
Gender F = feminine M = masculine 
State as in the extensive specification above 


29. There are neither collective nor distributive proper nouns. 
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3.3.2.4 Features of Verbs 
For the verbs, we supply their binyanim and their verbal semantic / valency information. We 
also specify the following information: 


D = dual (3x) 
F = feminine 
2nd 


Number 
Gender 
Extended person 


S = singular P = plural 


M = masculine 
3rd 


definite verb 


The phenomenon of the definite verb (also called the nominalized verb) occurs 52 times. An ex- 
ample is NNa’ ‘the-she-came’ in Job 2:11 in this context: 


YD nN2'n nNí'n ATTY n8 IPR y? NW yu 


The noun phrase part of the direct object of this clause is shown in phrase marker (3.11). 


(3.11) n:qual n:qual ds 
app cstr 73 all of 
n:qual n:qual " 
app ef 7 the 
ny evil 
adj ; 
er 1n the 
nNt this 
nom : 
def a n the 
nom cl vb 
nom obiq gram nga shecame 
mvt aim upon * pron nby upon 


gram prep ri 


The wips translates the direct object as “all these calamities that had befallen him.” 


3.3.3 Roots versus Identifiers 


Each segment type has an associated identifier akin to but sometimes different from the tradi- 
tional root. In our database, the segments that make up a dictionary entry are identified by a con- 
ventional root for verbs or by the consonantal framework for nonverbs. The identifier is thus the 
lattice of consonants from which the various forms making up an entry can be derived. In specify- 
ing the identifier, we ignore matres lectionis that are only occasionally used as vowel letters in a 
lexeme. Thus, for example, the plene form di2 ‘all of’ is attested once (as Kethiv in Jer 33:8), while 
the defective Y3 form occurs hundreds of times. Our identifier for this lexeme is therefore 53, not 
512. Our approach to lemmatization is strongly descriptive. In this, we are much closer to HALOT 
than to BDB.?? For example, we are less traditional and more descriptive in that we recognize 


30. See the introduction to E I. Andersen and A. D. Forbes, The Vocabulary of the Old Testament (Rome: Pontifical 
Biblical Institute, 1992). 
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biconsonantal roots. Thus, we use np rather than np as the identifier of verbs meaning ‘arise’, since 
no word in Biblical Hebrew derived from this root contains the consonant 1 as such.*! 


3.4 Brief Summary 


Top Levels of the Part-of-Speech Hierarchy. Concordant with modern linguistics, our parts 
of speech are organized hierarchically (fig. 3.1). Our 7 major categories are largely congruent with 
those invoked by traditional grammars of Biblical Hebrew. Subdividing these 7 major grammatical 
categories, we recognize 15 categories (fig. 3.1). We diverge from tradition mainly in: (1) form- 
ing the category of quasiverbals from lexemes classified by others as adverbials and locating 
them among the verbals; and (2) distinguishing closed interrogatives from the very different open 
interrogatives. 

Lower Levels of the Part-of-Speech Hierarchy. We then subdivide several additional catego- 
ries, to obtain a list of 37 subcategories (table 3.1). Eight of these are subdivisions of nouns (such 
as proper nouns, ethnics, etc.), and 10 are subdivisions of finite verbs (such as insistent imperatives, 
suffixed sequentials, etc.). An innovation is our splitting of participles into 4 subtypes: pure noun, 
noun-verb, noun-verb | noun, and pure verb. When we subdivide even more categories, we finally 
obtain 76 parts of speech (POSs) (fig. 3.2). These parts of speech consist of POSs that are: 


* individual lexemes (individual prepositions and conjunctions), 

* subsets of traditional POSs (such as nouns [by semantics], verbs [by morphology], and 
pronouns [free, bound, demonstrative]), 

* categories not usually recognized as POSs (such as modals, exclamatives, quasiverbals, 
closed and open interrogatives, nominalizers [distinguished on syntactic grounds |), 

* familiar categories (adverbs, adjectives [albeit with an untraditional definition]). 


Part-of-Speech Features. Tokens of the parts of speech are marked with features standard to 
the description of Biblical Hebrew: number, gender, person, state, and so forth. In addition, we 
mark substantives with naive semantic category labels such as appliance, building, creature, deity, 
event. 


31. Technical detail: When a verb has more than one (consonantal) alloroot, due to “irregularities” of various kinds 
in the paradigms, we choose the complete one or the most common one, except for verbs that are historically third yod, 
where the conventional “root” in dictionaries has a nonconsonantal he? in third position. 


Chapter 4 


Phrase Marker Concepts 
and Terminology 


At this point, we need to introduce a few standard ways of describing phrase markers and 
to specify how information propagates among and within them. Although the basic concepts are 
simple, the nomenclature will be new to many readers. To assist readers in mastering the terminol- 
ogy, we shall offer multiple occasions to exercise the concepts and terminology in upcoming sec- 
tions of this book. Readers are reminded that definitions of words and phrases underlined in the text 
(e.g., root node) may be found in the glossary at the back of the book. 


4.1 Phrase Markers Defined 


A phrase marker is a two-dimensional diagram that displays both the internal hierarchical struc- 
ture and sequential structure of a clause and its constituents. A phrase marker consists of a set 
of labeled nodes and the unlabeled directed edges (“arrows”) connecting these nodes. The edges 
show the dominance relations between constituents, pointing from dominating node to dominated 
node. The precedence relations are implicit, since the (vertical) sequence of the terminal segments 
is in the same order as the text in the Bible. The concepts underlying phrase markers can best be 
understood through a simple example. 

Phrase marker (4.1) for the first clause in Ruth 2:12 consists of eight edges and nine nodes. 
Recall that the glosses at the far right of the diagram are supplied as an assistance to readers. They 
are not part of the phrase marker and thus are not nodes. 


(4.1) i — a nw? let him recompense 
NES ————————— mm Yahweh 
Gam — "sur .— "M9 deed 


Tz  thee(f) 


The labels on nodes, other than the nodes that contain the text segments, are bipartite. Each node 
label consists of a construction / constituent identifier and a relation identifier. In displayed phrase 
markers such as (4.1), the former identifier is spatially above the latter. To save space in running 
text, we put the construction / constituent identifier and the relation identifier on the same line and 
separate them by a slash. Hence, the left-most node in the phrase marker above would be presented 
in text as cl / oblq. 
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A node at the left end of an edge is said to be the mother or parent of a node at the right end, 
the right node being termed the daughter or child. In phrase marker (4.1), the node cl/oblq is the 
parent (mother) of three children (daughters): vb/gram, sbj/gram, and dir obj / gram. 

With regard to the hierarchical relationship between parent and child, the parent is said to domi- 
nate the child immediately. The relation between the nodes is said to be one of dominance. In our 
example, cl/oblq immediately dominates vb/gram, sbj/gram, and dir obj/gram. But it also domi- 
nates all of the other nodes in the phrase marker. A node that dominates all of the nodes is said to 
be the root node or root of the phrase marker. Nodes that have no children—that is, that dominate 
no other nodes—are called the leaves of the phrase marker. The leaves are the text segments mak- 
ing up the clause that the phrase marker describes. 

With regard to the sequential structure of the phrase marker, nodes having the same parent are 
said to be siblings or sisters. A node that comes earlier in a clause than its sister is said to pre- 
cede its sister. The relation between the nodes is one of precedence. In the phrase marker above, 
vb/gram precedes sbj/gram which precedes dir obj/gram. 

The upper (or first) part of a node label identifies the structure immediately dominated by this 
node. There are two kinds of construction / constituent identifiers in our phrase markers. One identifies 
major grammatical functions, and the other specifies the structure of the constituents. 

The lower (or second) part of the node label specifies the grammatical relation that licenses (or, 
justifies) the formation of the structure that its node dominates. 


4.2 Phrase Markers Characterized 


The number of edges that enter a node is called the node's in-degree. The in-degree of a root 
node is always zero. The in-degree of all of the other nodes is one.' The number of edges that leave 
a node is its out-degree. In example (4.1), the out-degree of the root is three, the out-degree of the 
text segments is zero, and so on. 

There are two very simple ways to characterize the complexity of a phrase marker. The phrase 
marker's length is the number of Hebrew segments that it contains. The length of phrase marker 
(4.1) is four. The phrase marker's depth is the maximal number of generations encountered in pass- 
ing from the root node (“generation 1”) to the leaves (“generation N”). In example (4.1), the depth 
is four. 

Until we are forced to generalize them later in this book, our phrase markers will be /abeled, 
directed trees. Being a tree, any phrase marker obeys these four conditions:? 


1. Single-Root Condition. There is only one root node. 
2. Single-Mother Condition. Any non-root node has only one mother. 
3. Non-tangling Condition. Edges may not cross. 
4. Exclusivity Condition. For any two nodes in a tree, either: 
a. one dominates the other 
Or 
b. one precedes the other. 


1. In chap. 20, we study phenomena for which this is no longer true. 
2. Adapted from Robert Trask, A Dictionary of Grammatical Terms in Linguistics (London: Routledge, 1993) 285. 
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Trees have interesting properties.* For example, the number of edges is always one less than the 
number of nodes. Also, between any two nodes, there is always only one path. 


4.3 Information Propagation among and within Phrase Markers 


The set of phrase markers shown in this book (and made available by Logos Research Systems) 
has all instances of lexical and structural ambiguity resolved. We do this to produce what we con- 
sider the single most-salient parsing of each clause. Usually this policy leads to uncontroversial 
results. But occasionally it eliminates important, often seemingly intentional ambiguity. We plan to 
restore and represent ambiguity in later releases of our data. 

In this subsection, we will: 


* discuss our conventions for resolving ambiguity by propagating information among and 
within our phrase markers and by relying on world knowledge. 
* specify our rules for promoting semantic information within our phrase markers. 
4.3.4 Information Sharing among Phrase Markers 
What should the gloss on 0°78 in phrase marker (4.2) from Job 42:8 be? Is ‘rams’ correct? 


2) a << 2 “IN  you(m)take 
Sw benf — . to*pro —— -  — — —  — b ous 
gram prep 7 
Des 02  you(m) 
palin, LÍ union nonce L———— yi seven 
LL a D9 bulls 


pi 7y3 seven 


When we encounter the final segment, we consult a lexicon to discover its available semantics. In 
BDB, we find five entries (pp. 17-18 and 42), each with its own distinctive semantics: 


1. ‘ram’ (semantics: anml) . . . e.g., Isa 1:11 

2. ‘projecting pillar’ or ‘pilaster’ (semantics: bldg) . . . e.g., Ezek 40:14 
3. ‘leader, chief’ (semantics: humn) . . . e.g., Job 41:17 

4. ‘terebinth’ (semantics: veg) .. . e.g., Isa 1:29 

5. ‘gods’ (semantics: deity) . . . e.g., Exod 15:11 


When an analyst selects the semantic category for m» here, it is very easy to be oblivious to the 
thought processes and knowledge that constrain the selection. Here, one might proceed as follows: 


3. Nora Hartsfield and Gerhard Ringel, Pearls in Graph Theory: A Comprehensive Introduction (San Diego: Aca- 
demic, 1994) 17—20. 


46 Chapter 4 4.3 


1. Since the first conjunct in the union phrase making up the direct object has animal 
semantics, we might decide that the second conjunct should have the same semantics.* 

2. We might read ahead two clauses and learn that a burnt offering is to be made. Given this 
fact, we might infer that the DYW are to be part of the offering. Knowing what we know 
about burnt offerings, we might thereby rule out all semantic categories but the first.? In 
this case, information from a nearby phrase marker plus knowledge of Israelite sacrifices 
might allow us to make a convincing assignment. 


Were we determined initially to work clause-by-clause and then advance higher into discourse, 
then we might either leave the semantics completely unspecified, or we might under-specify the 
semantics in a list: Sem = (anml, bldg, humn, veg, deity}. Instead, we have chosen to consider all 
available information so as to allow us to select a single value for each semantic feature. 


4.3.2 Semantic Feature Value Propagation within Phrase Markers 
Selecting a single value for each semantic feature enables us to enhance our phrase markers 
by representing the semantics of the syntactic constituents. But to do so, we must specify rules for 
propagating semantic information in phrase markers. We rely on this modified recursive rule: 
Basic Recursive Rule. The semantic feature value for a substantive constituent is the value of 
its first immediate constituent with the following exceptions: 


1. Definite Phrase Exception. In a definite phrase, the semantics of the phrase is that of the 
determined substantive (the second immediate constituent). 

2. Inverted Modification Exception. In an inverted modification phrase, the phrase semantics 
are those of the modified constituent (the second immediate constituent). 

3. Construct-Chain Exception. In a construct chain having one or more nomina regens with 
quantity semantics, the semantics of the final nomen rectum are promoted to the level of 
the earliest quantity nomen regens. 

4. Construct-Chain Patch. In a construct chain, when a nomen regens specifying lineage 
(e.g., “son of,” firstborn of,” etc.) and having human semantics precedes a nomen rectum 
with animal semantics, the construct phrase gets animal semantics. 


An example of each of the parts of this rule may be helpful. 
Basic Recursive Rule. Illustrative of the basic recursive rule is the constituent from phrase 
marker (4.3) for Gen 26:15. 


(4.3) jx “Y days of 


NC n:humn 


app 


Di3N Abraham 


N n:humn ^2N father 


1 his/him 


4. This inferential leap is on shaky ground. Heterogeneous semantics is common in union phrases. In three-segment 
union phrases (“X and Y"), the semantics of the conjuncts differ almost a third of the time. Consider these few instances 
of semantic heterogeneity in union phrases that involve animals: Gen 26:14 (animals and slaves), Exod 14:9 (animals 
and humans), Zeph 1:3 (people and animals), and 1 Chr 5:21 (animals and animals and animals and humans). 

5. This excludes an unfolding tale wherein sacrificial bulls and participating chiefs /leaders are to be taken to Job. 
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We assign human semantics both to YIX and to TAX 07738. The semantics of the complete noun 
phrase, PAX DiuN ^», is time. 
Definite Phrase Exception. Consider fragment (4.4) from the phrase marker for Neh 8:7. 


(4.4) n:humn 
def 


'm the 


iu" DY people 


The definite noun phrase has human semantics (humn), promoted from below. 
Inverted-Modification Exception. Phrase marker (4.5) from Job 42:8 shows inverted 
modification. 


(4.5) manm —-00 (00 | “nyaw seven 


med BR cA 
Da bulls 


The constituent has the semantics of its second immediate constituent (079), namely anml. 

Construct-Chain Exception. The construct chain in (4.6) from Ezek 27:12 illustrates the ex- 
ception. Both 35 and 73 have quantity semantics, so the semantics of the complete chain (val = 
valuable) is that of the final nomen rectum (pin). 


(4.6) n:val 4 


cstr multitude of 


s mval — 0, -53 allof 


cstr Ne 
lin wealth 


Construct- Chain Patch. This patch is necessitated by the limited granularity of our semantics 
categories. We have no category that covers living beings, be they human or animal. Hence, in a 
constituent such as “inn 709 in (4.7) from Exod 34:20, we promote the nomen rectum semantics 
so the noun phrase has animal semantics, thereby avoiding the problem of a noun phrase wherein 
the firstborn of a donkey would have human semantics (that being the semantics assigned to the 
lexeme glossed ‘firstborn of"). 


(4.7) n:anml 
cstr 


—— Va firstborn of 
"inan donkey 
4.4 The Phrase Marker Creation Process 
Our phrase markers were generated in three steps, with the second and third steps ongoing: 


1. Generate phrase markers by designing, implementing, and using a battery of computational 
parsers (computer parsing).° 


6. Our parsing strategy is covered in considerable detail in E. I. Andersen and A. D. Forbes, "Opportune Parsing: 
Clause Analysis of Deuteronomy 8," in Bible and Computer: Desk and Discipline (Paris: Honoré Champion, 1995) 
49-75. 
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2. Extend the phrase markers beyond simple trees and detect / correct parsing errors by 
repeatedly reading through the phrase markers and by checking instances where the parsing 
rules were known to be under- or over-productive (human over-reader correcting). 

3. Make the phrase markers consistent via various strategies (consistency checking). 


Phrase marker (4.8) from the fable in Judg 9:10 will help us illustrate the process. All of the phrase 
marker outside the three boxes was correctly generated by the computer parser. The over-reader 
then assembled the speech into a sentence and made it the daughter of the obj addr node, as shown 
in the large dotted box. Since trees do not talk, the parser did not identify "the trees" as the subject 
of "they said." The over-reader recognized that the until-then-unassigned node label in the upper 
dotted box is subject and made the assignment. The parser erroneously assigned the label in the 
solid-boxed node a value of mvt aim/gram (movement aim"). The computer was acting on its 
faulty "knowledge" that “the trees" could not listen but could be a goal of movement. The consis- 
tency checker subsequently detected and corrected this error. 


(4.8) cl ,diseqw 


^ and 


oblq SS. gram 
vb " . 
gram MAN?  they(m) said 
| sbj; n:veg , 
gram: def .— ^ € the 
N D'.P trees 
ind obj to * veg 
n:veg , 
def 2 the 
"UND fig 
obj addr! | sent uo ; cl vb ay 
gram ^7 * juxt ^7 *ioblq .— gram — — — — — — — — 2?» thou(f) walk 
NM: \ sbj 
\ gram nN thou(f) 
Y cl vb . f 
oblq gram —— — ——————— 37M  thou(f) reign 


ruled upon * pron "by upon 


gram prep 


4.5 Exercise: A Test Case Phrase Marker 


Let us take the phrase-marker concepts out for a bit of exercise. For phrase marker (4.9) from 
Prov 3:9, what are the answers to these questions? (Answers are supplied below the phrase marker.) 


1. How many nodes are there? 

2. Without further counting, how many edges are there? 
3. Which is the root node? 

4. What are the root node's in-degree and its out-degree? 
5. What is the depth of the tree? 
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6. What is the length of the phrase marker? 
7. How many children does the from+val/union/disj node have? 
8. What node immediately precedes 72? 
9. What node(s) dominate the segment 23? 
(4.9 d vb = 
oblq X gram 733  thou(m) honor 
dir obj — obj mk + deity 
gram prep n SCREEN TIN [nota acc.] 
mm Yahweh 
rsrc from * val from * val " from 
gram union/disj prep iene - 
nva! wealth 
thee(m) 
and 
from + time : 
prep | from 
M — MORN beginning of 
n:veg - 
E 72 allof 
"uf — NNN yield 
772  thee(m) 


Answer #1: 24 nodes. Answer #2: One less than the number of nodes, namely 23 edges. Answer #3: The node 
labeled cl/oblq. Answer #4: In-degree = 0; out-degree = 3. Answer #5: depth = 8. Answer #6: length = 12. 
Answer #7: 3 children. Answer £8: mWx3. Answer #9: There are 6 dominating nodes: n:veg/cstr, n:time/cstr, 
from+time/prep, from+val/union/disj, rsrc/gram, and cl/oblq. 


4.6 Brief Summary 


Vocabulary of Trees. We introduce the vocabulary for describing phrase marker trees: node, 
edge, root, leaf, mother, daughter, sister, in-degree, out-degree, dominance, and precedence. 

Trees versus Graphs. Technically, the vast majority of our phrase markers are labeled, directed 
trees. We explain the conditions that these sorts of structure obey: single-root, single-mother, non- 
tangling, and exclusivity. 

Information Propagation and Sharing. A set of conventions governs the propagation of in- 
formation, especially semantic information, within our phrase markers. The conventions include a 
basic recursive rule and overrides for handling definite phrases, inverted modification constructions, 
and construct chains. 

The Phrase Marker Creation Process. The phrase markers were initially created by a battery 
of incremental computational parsers. Corrections, extensions, and consistency enforcement are the 
ongoing work of human over-readers. 


Chapter 5 


The Basic Phrase Types of 
Biblical Hebrew 


5.1 The Constituent Hierarchy of Biblical Hebrew 


Before studying phrases, we will examine where phrases fit in the constituent hierarchy of Bib- 
lical Hebrew. Each level of the hierarchy typically is built up from constituents in the next lower 
level. We say “typically” because higher-level structures (such as clauses, sentences, and other 
kinds of discourse units) can be contained (“embedded”) within lower-level structures. ! 

Our basic constituent hierarchy has five levels. These are most easily understood by considering 
an example. In grid-overlain phrase marker (5.1) for Gen 3:21, the columns of the superimposed 
grid identify the five levels making up the phrase marker. From right to left, these are: segment, 
phrase, CIC (clause immediate constituent), clausal, and supra-clausal. 

The “lowest” (right-most) level of the phrase marker hierarchy consists of the text segments. 
By now, readers have a good grasp of what a segment is, but a bit of repetition should not be amiss: 
a segment is a word, part of a word, or a sequence of words that is an ultimate constituent in our 
syntactic analysis. Segments are the “atoms” of our analyses. Single segments can themselves be 
phrases or can combine to form more complicated phrases. 


5.2 Basic and Complex Phrases 


This chapter introduces the basic phrase types of Biblical Hebrew. A phrase is basic if it contains 
neither phrase type(s) nor higher-level constituents.* Equivalently, a basic phrase contains only one 
licensing relation. A phrase is complex if it contains phrase type(s) or higher-level constituent(s), in 
which case it must involve more than one licensing relation. 

Consider an example: JPY? OX “mother of Jacob" is a basic phrase, a construct phrase. APY? 
Wy “Jacob and Esau” is another type of basic phrase, a union phrase in our parlance. But APY? OX 
Wy) “mother of Jacob and Esau" is a complex phrase, since it consists of a union phrase contained 
in a construct phrase as its nomen rectum. Phrase marker (5.2) shows this complex phrase from 
Gen 28:5. 


1. Embedding is the focus of chap. 8. 
2. Supra-clausal constituents will be dealt with in chap. 21. The higher-level constituents consist of CICs (clause 
immediate constituents) as well as clausal and supra-clausal constructions. 
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(5.1) Supra- Cla CIC Phrase: Tightly Joined, Simple (Unconjoined), Kernen 
Clausal and Conjoined 
' sent E ‘dl seq w 2 
discrs ~~” oblq N gram 1 ‘and 
LI ' ' ' 
H i l vb 1^. a 5 
EN |^ gram + Wy? ‘he made 
sbj S S n:deity ——— nih ‘Yahweh 
"n gram ; app SS : 
y : DDN ‘God 
ii benf | | to+humn to + humn 5 ' 
n — : oS ? ito 
n gram  ;  union/disj prep CUNT Fd 
ii * DIN ‘Adam 
1 1 ^ ;and 
i \ to + humn | 
at prep 7? ito 
i Y n:humn "m5 
i i suf 7? FIR ‘woman 
18 4 
T : | ~~ i ‘his/him 
i 1 (dirobj ; ,mgam ——  —— y niina ‘tunics of 
ui ^iP skin 
mem ————— ———— € m— m 
; oblq |! grams: H | 
M dir obj | — 
ORO QUEE LT UE URORUEDBEREGRORSIIORNEIEEENENRROD an EE 
(5.2) n:humn 
cair DN mother of 
N n:humn 


union/dis; —— 7 3P}? Jacob 


NS ^ and 


‘wp Esaw 


Note that the n:humn / cstr node dominates the n:humn / union/disj node and therefore has included 
within itself the union construction. 
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Phrase marker (5.1) has four basic phrases: n:deity/app, the upper to-humn/prep, n:humn/suff, 
and n:garm/cstr. It contains two complex phrases: to+humn/union/disj and the lower to+humn/prep. 
The phrase marker is tidy. The “higher” constituents contain the “lower.” 

We have not yet described the various phrase types and their composition. The basic phrases are 
straightforward. In this chapter, we describe them. In chap. 6, we take up complex phrases. 

We work with three kinds of basic phrases: 


1. tightly joined, 
2. simple (unconjoined), 
3. conjoined. 


These phrase types are uniquely defined by the sets of relations that license their formation. 


5.3 Basic Tightly Joined Phrases 


A phrase that is declared to be “tightly joined" is licensed either by the relation of suffixation 
(suff) or by definiteness (def). Both relations are highly local and hence are referred to as "tight." 
Phrase marker (5.3) for Gen 42:20 contains both types of tight phrase.? Nodes licensed by the suff 
and def relations are usually right-shifted so as to be close to their children segments. 


5.3 cl dir obj obj mk + humn 
d oblq gram prep TIN [nota acc.] 
M n:humn n:humn : 
app stt -> MN brother 
Mc `~ D3 you(m) 
n:qual : 
def n the 
i pop little 
vb . 
gram wan do you(m) bring 
mvt aim to * pron M d 


gram prep COUNT Oe 
» me 


The significance of the structures and labels making up the phrase marker are explained below. 


5.3.1 Tight Pronominal Suffixes 

Our Kethiv text of Biblical Hebrew contains 45,025 pronoun suffixes. More than half of these 
(5596, or 24,894) are in tight phrases.* The table shows the distribution of these pronominal suffixes 
in terms of the various parts of speech to which they attach. 


3. Note that what is in apposition in (5.3) is not the “adjective” ‘little’ but a definite nominal; ‘the little’ could have 
been called a definite adjective. 
4. We use “tight” rather than “bound” because the meaning of “bound” is often extended in linguistic theory. 
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Tightly Suffixed Part of Speech Percentage 


Common Noun 95.6% 
Pure Noun Participle 3.0% 
All -93 0.996 
Cardinal / Ordinal Number 0.5% 


One exception aside,? each of the pronominal suffixes that is a constituent in a tight phrase com- 
bines only with its preceding segment, making this pronominal suffix relationship highly local. 

The pronominal suffix in a tight phrase functions as a “possessive” pronoun or akin to one.° For 
example, in the phrase marker above, the n:humn/suff node dominates the noun °M and the suffix 
D2-, yielding a constituent that may be translated ‘your [masc. pl.] brother’. 


5.3.2 Determined Segments 

Our Kethiv text of Biblical Hebrew contains 30,298 definite articles. A prefixed article segment 
usually combines immediately and only with a single following segment. This is the situation in 
the vast majority of cases, 29,166 times (96.3%). In these cases, the relationships are local—that 
is, tight. The remaining 3.7% of the definite articles (1,132 in all) are constituents in phrases that, 
strictly speaking, are not tight. These constructions are complex. Their exposition is given in chap. 6. 


5.4 Basic Unconjoined Phrases 


Included among the basic unconjoined phrases are all manner of prepositional and nominal 
phrases that involve neither tight suffixation nor definiteness nor explicit conjoining via conjunc- 
tions nor implicit conjoining via juxtaposition. We identify four basic unconjoined phrase types: 


1. prepositional phrases, 
2. construct phrases, 

3. simple modification, 
4. inverted modification. 


The first two phrase types are consistent with traditional analyses and will be dealt with only briefly. 
The final two may be less well known and thus will be treated in greater detail. 


5.4.1 Prepositional Phrases 

A basic prepositional phrase is licensed by the relation of preposition (prep), and itself con- 
tains no basic phrase types or higher-level constituents. Of the 74,058 prepositions in Biblical 
Hebrew, 24,973 (34%) are part of basic prepositional phrases.’ The prepositions in basic preposi- 
tional phrases combine with sibling segments of the sorts indicated in the table. 


5. The exception is in Ps 89:3. This fascinating verse is discussed at the end of §20.2.3.4, phrase marker (20.17). 

6. We write “akin to one,” because in a tight phrase such as 7291 ‘my king’, the speaker is not claiming to own the 
king. 

7. We recognize two other unconjoined phrase types: nominalized clauses (§8.1) and distributed apposition 
(§20.2.2). 

8. This total includes the direct object markers. 

9. Almost 2,900 are segments in compound prepositions, the creation of which is licensed by the juxt relation. See 
§5.5.3. 
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Sibling of Preposition 
Pronoun Suffix 


Representative Gloss | Percentage 
51.2% 
43.8% 


“to him” 
“to Adam” 
“from there” 
“like these” 
“to captured-ones” 


“until them," 0.1% 
“to become-many” 


Noun 
Adverbial 
Demonstrative Pronoun 
Pure Noun Participle 
Other 


The constituent from Job 34:31 in (5.4) illustrates the appearance of a basic prepositional phrase. 
The constituent consists of the preposition 5x ‘unto’ (to) followed by (+) the proper noun 98 ‘God’ 
(deity). The construction is licensed by the relation preposition (prep). 


(5.4) to + deity -bN 


> unto 


prop ee 
5x God 


5.4.2 Construct Phrases 

A basic construct phrase is licensed by the relation of construct (cstr) and itself contains no 
basic phrase types or higher-level constituents. A construct relation licenses the assembly of a con- 
struct construction (or *chain"). A noun in the construct state always forms a construct phrase by 
pairing with a following noun, noun phrase, or nominalized constituent. 

Of the 34,384 construct state segments in Biblical Hebrew, 16,925 (49%) are the lead-items 
(nomina regens)? in basic construct phrases. The table shows the parts of speech of the nomina 
regens in the basic construct phrases. 


Part of Speech of Nomina Regens 
Common Noun “sons of [Noah]" 
“all” 


| Proper Noun 


Percentage 
93.2% 


Representative Gloss 


” 


“all of [unclean thing] 
“Ur of [Chaldeans]" 


Phrase marker (5.5) from Neh 9:7 has n"ri3 VN “Ur of Chaldeans” (two proper nouns!). Licensed 
by the relation of construct (cstr), a basic construct noun phrase having geographic semantics 
(n:geog) is shown. 


(5.5) mgeog ___ ———. ARN  Urof 


cer ~ 
ww Chaldeans 


10. Nomina regens = ‘reigning nouns’. 
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5.4.3 Simple Modification 

A basic modification phrase is licensed by the relation of modification (mod), and itself contains 
no basic phrase types or higher-level constituents. The modification relation licenses a construction 
where the first constituent is characterized more precisely by the second. The second constituent 
adds information. Example: “the water under the earth.” 

Of the 4,086 modification constructions in Biblical Hebrew, the vast majority involve prepo- 
sitional phrases, and these will be taken up in chap. 6. Only 442 (11%) are in basic modification 
phrases. In a basic modification phrase, the second constituent may be an adverbial (as in “big 
very”), a numeral (as in “cows six"), or a common noun (as in “filled [with] incense”). 

Phrase marker (5.6) from 1 Sam 1:24 showing nw5w n"35 ‘bulls three’ involves a basic modifi- 
cation noun phrase with animal semantics (n:anml), its assembly being licensed by the modification 
relation (mod). 


n:anml 
mod 


(5.6) 


———————— 0 bulls 


nwow three 


5.4.4. Inverted Modification 

A basic inverted modification phrase is licensed by the relation of inverted modification (inv 
mod) and contains no basic phrase types or higher-level constituents. The inverted modification 
relation licenses a construction in which the second constituent is characterized more precisely by 
the first. The first constituent adds information about the second. Example: “six cows." 

Of the 3,537 inverted modification constructions in Biblical Hebrew, 2,138 (60%) are in basic 
inverted modification phrases. The table provides the census of the main inverted modifiers. 


Inverted Modifier Part of Speech | Representative Gloss | Percentage 
Cardinal Number "thirty [years]" 42.396 
Quasiverbal “behold [I gave]” 34.6% 
“also” D3 “also [I]” 10.8% 
Negative “not [good]” 7.2% 
Other — 5.196 


This table involves very disparate kinds of constituents, lumped together by being licensed by the 
inverted modification relation. (Quasiverbals are thoroughly dealt with in chap. 18. Discontinuous 
inverted modification involving quasiverbals is taken up toward the end of §20.1.2.) 

Phrase marker (5.7) from Gen 11:18 has a basic inverted modifier phrase involving the cardinal 
number ‘thirty’. 
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(5.7) cl dl seq w : 
oblq N gram | and 
m ————————— Tm he lived 
gram ——————————— 1399 Peleg 
tm int n:time sys : 
gram inv mod `” DWY thirty 


mw year 


Or, consider phrase marker (5.8) from Gen 27:31. It contains a basic inverted modification phrase, 
a clause-internal instance of inverted modification by 03. 


= obia "ed p ^ and 
N ei wy" he made 
en EG rod ———— "03 also 
TE Ni he 
ik. nm"ayun delicacies 


5.5 Basic Structurally Defined Conjoined Phrases 


Conjoined phrases are created by joining constituents via a particular set of licensing relations. 
We distinguish two subtypes of conjoined phrases: phrases the assembling of which is based on 
structural criteria, and phrases the assembling of which is based on semantic criteria. As we will 
see in $5.6, the semantically defined phrases involve refined specification of the structurally defined 
phrases. 

We distinguish three types of structural conjoining: 


1. union (“and”) or disjoint (“or”) phrases (union/disj), 
2. gathered (mixed list) phrases (mix list), 
3. joined phrases (juxt). 


5.5.1 Union or Disjoint Phrases 

Often called a coordinate phrase, a union (or disjoint) phrase consists of a series of constitu- 
ents, connected by coordinating conjunctions: *X and Y and Z" (or disjunctive conjunctions: *X or 
Y or Z”). Across Biblical Hebrew, the union relation is used 14,633 times, most of them resulting 
in complex coordinate phrases. In addition to its principal use in licensing the assembling of coordi- 
nate phrases (9,511 times), the union relation also licenses the combining of clauses into sentences 
(5,049 times).!! We count 2,293 basic (single licensing relation) coordinate phrases in Biblical He- 
brew. Most (2,052 or 89%) contain only two conjuncts. ? Phrase marker (5.9) for a basic coordinate 


11. Very rarely, there is compounding of clause adjuncts (ca. 30 times). On adjuncts, see 87.3.1. 
12. The longest basic conjoined phrase contains 14 city names (for 13 cities) in a list in Josh 19:2-6. 


5.5 
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(5.9) 


n:humn 
union/disj 


myn 


Wy 


Maseyah 
and 
Shemayah 
and 

Elazar 

and 

Uzzi 

and 
Yehohanan 
and 
Malkiyyah 
and 

Elam 

and 


Ezer 


phrase (licensed by union/disj) consists of a list of names of humans and is taken from Neh 12:42. 
The resulting noun phrase has human semantics. 


5.5.2 Gathered (Mixed List) Phrases 


(5.10) n:humn ar i 
mix list oY Shem 
on Ham 
^ and 


nm» Yapeth 


If one deletes some (but not all) of the conjunctions in a coordinate phrase, then the resulting 
construction is a gathered phrase or a mixed list. This specification entails that a basic mixed list 
must contain at least three conjuncts plus at least one coordinating conjunction. The licensing rela- 
tion for a mixed list is mix list. Phrase marker (5.10) shows a basic mixed list of names of humans 
from Gen 10:1. These names appear elsewhere as basic coordination phrases (union/disj) in Gen 
7:13 and 9:18. 

We invoke this licensing relation 930 times across Biblical Hebrew. In 30% of its appearances, 
the relation gathers clauses together to form sentences. In 2096, the mixed lists consist of human 
names. The relation occurs in basic mixed lists 157 times and in non-tree phrase markers 27 times. 


58 Chapter 5 55 


5.5.3 Joined Phrases 


If one deletes all of the conjunctions in a coordinate phrase, then the resulting construction is 
a joined phrase or, more loosely, a juxtaposed phrase (juxt). This specification entails that a joined 
phrase contain at least two “conjuncts” and no coordinating conjunctions. Phrase marker (5.11) 
from Ezra 8:24 shows a basic two-segment juxtaposed phrase. The syntax of the clause of which 
the two segments are a part is quite complicated, but it is clear that they are part of a single basic 
joined phrase. 


(5.11) n:humn maw  Sherebyah 


juxt ln cM 
mvn  Hashabyah 


Joined phrases also occur when prepositions or parts of numerals are immediately adjacent. Phrase 
marker (5.12) from Ezek 10:2 contains a pair of triple-preposition constructions. 


(5.12) a vb 


N3  thou(m) come 


oblq NC gram 
mvt aim to * utnsl unto a 
gram prep juxt ee li i 
nipa between 
to 
n:utnsl "7 the 
def = 
a 5353 wheel 
mvt aim to + deity unto - 
gram prep juxt p unto 
Nae underneath 
^? to 
n:deity im 
df — 9 the 


N 23113 cherub 


All told, there are 5,015 juxt relations in Biblical Hebrew. Of these, 1,833 appear as licensing rela- 
tions in the formation of sentences, leaving about 3,200 in phrasal constructions. 138 of these oc- 
cur in non-tree phrase markers. The 3,200 constructions are divvied up between basic and complex 
joined phrases. Over two-fifths (1,440) appear in joined prepositions, of which the most common 
first preposition (around 63%) is -) or other forms of ‘from’. There are 1,367 basic joined phrases 
that consist of two juxtaposed prepositions, 45 basic joined phrases that consist of three juxtaposed 
prepositions, and two that consist of four juxtaposed prepositions (1 Sam 7:11 ‘until from under 
to’ and 1 Kgs 7:32 ‘to from under to’). The combining of cardinal numbers is often licensed by 
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juxt. There is one basic joined phrase having four adjacent numerals (Neh 7:11), 42 having three 
adjacent numerals, and 577 having two. The remaining joined phrases are a variety of other noun 
phrases. 


5.6 Basic Semantically Defined Conjoined Phrases 


The pair of basic conjoined noun phrases in (5.13) and (5.14) is drawn from Gen 2:21 and 
Exod 3:4. 


(5.13) mdeity 0 m 
app i c Yahweh 
DIN God 


(5.14) n:humn __ nW Moses 


nw Moses 


Examination of the texts from which these constructions are extracted discloses that a proper pars- 
ing of each requires that the pairs of segments combine to form basic two-segment phrases. The 
structural relation of juxtaposition (juxt) would be a valid licensing relation for each phrase. The 
segments are, after all, juxtaposed in the text. However, we have elected to enhance our representa- 
tion by introducing five specialized licensing relations. Two of these can appear in basic phrases: 
echo (echo) and apposition (app). While juxtaposition seemingly requires only adjacency," ap- 
position requires identical reference and echo requires identical form, function, and reference. ^ In 
the next two subsections, we will examine these two types of basic semantically defined conjoined 
phrases. 


5.6.1 Echo Phrases 


“Echo” refers to the exact repetition of some constituent within a clause in such a way that all 
of the repeated constituents combine to form an echo construction. The morphologically identical 
constituents are usually adjacent, but adjacency is not sufficient since adjacent morphologically 
identical segments may be in different clauses and/or may exercise different functions in their 
clause(s). For example, consider (5.15) showing two consecutive clauses from Qoh 7:3-4. 


13. The other three types of semantically defined conjoined phrases are always complex: strictly subsetting, strictly 
supersetting, and nested. These types are taken up in chap. 6. 

14. The licensing relation of juxtaposition is rarely invoked when the joined constituents are not adjacent. Nonad- 
jacency occurs in less than two percent of the cases licensed by juxt. 

15. Strictly speaking, two successive segments might be homographs having the same form but differing functions. 
In such a case, the items would not combine to form a single conjoined constituent. We know of no actual case of this 
phenomenon in the Hebrew Bible, and so an example from English must suffice: “I saw the duck duck.” 
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(5.15) cl instr in + abst an 
oblq gram prep "4 
NS n:abst : f 
ostr -> 2) evilof 
DIa face 
vb " . 
gram 3v" he will be good 
sbj 
gram re, heart 
cl sbj n:body prt 
oblq gram cstr © heart of 
x EL ayami wise 
loc in + bldg na d 
gram prep 3 in 
BT mbidg —, r3 house of 
cstr 
pats, funeral 


The sequence 25:25 spans the boundary between the clauses. The first 35 is in the normal state, 
while the second is in the construct state. They cannot form an echo phrase for two reasons: They 
span a clause boundary, and they have differing functions. Thus we see, perhaps surprisingly, that 
even the recognition of echo phrases can require a considerable amount of analysis. 

We find 229 echo phrases in Biblical Hebrew. There are 161 basic echo phrases and 68 complex 
echo phrases. There are 18 that require non-trees for their representation. 

Most echo phrases contain two sisters (218 instances, 95.2%). Among the basic two-sister echo 


phrases, we find: 


e 220 ‘around’ echoed 27 times—phrase marker (5.16) from Ezek 37:2, for example: 


(5.16) 


cl di and 
— 


istin TCO! and 


oblq N 
V. pa "v3yn he caused to cross 


gram 
pots — "? me 
a LI sas ron —— by upon 
on them(m) 
gram edo LL Xu around 


230 around 


e wx ‘man’ echoed 16 times—phrase marker (5.17) from Num 1:4, for example: 
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(5.17) cl dl and ; 
oblq N 1st in cl } and 
accmp with + pron ": . 
gram prep 9 —* DN with 
vb r 
gram — — — ————— m"  they(m) will be 
sbj n:humn ; 
gram echo — WN man 
WN man 
refrnt to + humn , 
gram prep 7 to 
n:humn " 
def o the 
nem tribe 


* Numerals echoed 14 times—phrase marker (5.18) from Ezek 10:21, for example: 


(5.18) cl sbj n:body prt 


oblq gram inv mod echo” "9318 
05 
poss to*qdy — s 

gram 


four 


four 


face 


to 


* RN ‘very’ echoed 12 times—see phrase marker (5.19) from Gen 7:19: 


(5.19) cl dl and ^ 
oblq 1st in cl 
N sbj n:sbst 5 
gram def 2 
vn 
vb 

gram nR 

mnr very i 
gram echo I TRI 
IRN 

loc upon + geog - 
gram prep p 


NS nigeog 


def 


——9 'n 
7 


n 


and 

the 

water 

they excelled 
very 

very 

upon 

the 


land 
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5.7 


There are four echo phrases having three sisters: Isa 6:3 (‘holy, holy, holy’), Jer 7:4 (“Temple of 
Yahweh, Temple of Yahweh, Temple of Yahweh’, a complex echo phrase), Jer 22:29 (‘land, land, 
land’), and Ezek 21:32 (‘ruin, ruin, ruin’). Phrase marker (5.20) shows Ezek 21:32a: 


(5.20) cl obj cmp n:bldg 


oblq gram echo ——7 MW ruin 
my ruin 
my ruin 
vb "m 
gram D'UN I shall put 
ps Tio her 


The object complement (obj cmp/gram) 6 in this clause is a basic triple-echo phrase, a noun phrase 
with building semantics (n:bldg/echo). " 


5.6.2 Apposition Phrases 

An apposition phrase is a construction wherein two or more constituents have an identical refer- 
ence. An apposition phrase is basic if its constituents are all segments. There are 17,961 apposition 
phrases in Biblical Hebrew. Of these, 676 require non-tree representation. There are 2,207 basic 
two-segment apposition phrases and 19 basic three-segment apposition phrases. Thus, we see that 
the vast majority of apposition phrases are complex. 

Phrase marker fragment (5.21) from Jer 5:8 is a basic three-segment apposition phrase: 


n:anml 
app 


(5.21) — DDD horses 


N D317) lustful ones 


D’DW lusty ones 


5.7 Brief Summary 


The Constituent Hierarchy. 
the chapters indicated): 


Our constituent hierarchy has these five levels (each taken up in 


— clause immediate constituent 
chaps. 9-16 


supra-clausal — clause 


chap. 21 


— — 


phrase 
chaps. 5-6 


segment 
chaps. 7-8 chap. 3 
We define two types of phrases. A basic phrase involves only one licensing relation, while a com- 
plex phrase involves more than one licensing relation. This chapter provides a census, with ex- 
ample phrase markers, of the various kinds of basic phrases attested in Biblical Hebrew. 

Basic Tight Phrases. Basic tight phrases consist of a pronoun suffix or a definite article plus a 
single-segment substantive. Overwhelmingly, the substantive is a common noun. 


16. The identification of most clause immediate constituents is licensed by the nonspecific relation gram “gram- 
mar determined." 
17. That is, the noun has the semantics of a “building.” 
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Basic Unconjoined Phrases. Basic unconjoined phrases involve neither suffixation nor defi- 
niteness nor explicit joining via conjunctions nor implicit joining via juxtaposition. Basic uncon- 
joined phrases are of four sorts: prepositional phrases (“to David"), construct phrases (“sons-of 
Noah"), simple modification phrases (“bulls three"), and inverted modification phrases ("three 
bulls"). 

Basic Structurally Defined Conjoined Phrases. Basic structurally defined conjoined phrases 
are of three sorts: union phrases (“Shem and Ham and Yapeth"), mixed lists (*Shem, Ham, and 
Yapeth"), and juxtaposed lists (“Shem, Ham, Yapeth"). 

Basic Semantically Defined Conjoined Phrases. Basic semantically defined conjoined phrases 
are of two sorts: echo phrases (“ruin, ruin, ruin") and apposition phrases (“man Canaanite"). 


Chapter 6 


Complex Phrases in 
Biblical Hebrew 


6.1 Embedding 


Language allows, indeed relies upon, abundant embedding, whereby “higher” constituents are 
contained in “lower” constituents.! Consider phrase marker (6.1), which shows the syntactic struc- 
ture of part of the direct object of the main clause in Deut 29:25. Reading diagonally down the 
left-hand edge of the phrase marker, we see that we are dealing with a noun phrase, a “lower” 
constituent, having divine semantics (n:deity). We also see that the first segment (0°7>X) joins with 
the nominalized construction (nom / nom) because the latter is in apposition (app) to the former.” 
The construction that is explicitly nominalized by WX is a sentence (sent), a "higher" constitu- 
ent, formed by conjoining two clauses (both cl/oblq) by “and,” licensed by the union (or disjunc- 
tion) relation (union/disj).? The details need not detain us. The point is that this sort of construc- 
tion, wherein clausal and supra-clausal (that is, “higher” than the clause) constituents are contained 
within otherwise straightforward noun phrases or other constituents, is rather common (see (6.1)). 

If readers who have an appetite for bizarre examples and who have access to a rendition of our 
database bring up the Andersen-Forbes phrase marker analysis and have a look at Jer 11:1—5a, they 
will encounter a main clause that we believe is the champion for iterated embedding of sentences, 
clauses, and infinitives construct.* A suitable path begins in Jer 11:1, where the reader encounters 
14 embedded predicator structures: 


11:1 {clause {clause (inf. constr. 
11:2 {sent {sent 
11:3 {clause {clause {clause {clause 
11:4 {clause (inf. constr. (sent {clause {clause 


33305109 


1. Our five-level hierarchy of constituents is shown and discussed at the beginning of $5.1. In phrase marker (5.1), 
higher-level constituents are to the left of lower-level constituents. 

2. We discuss the nominalized clause construction (nom / nom) in $8.1. 

3. In this phrase marker, we encounter dotted edges. They signal that the supra-clausal construction of which they 
are part is an interim makeshift that we hope to replace by a proper discourse-analytic construction one day. 

4. We thank Eli Evans of Logos Research Systems for searching out this monumental syntactic structure for us. 
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(6.1) ped DIN gods 
iii TUN which 
‘sent cl neg "ND not 


union/disj ~~ oblq 


b NN 
NN vb ai tia) 
A ‘ oun WT they knew 


iN dir obj 
UN gram ——————————. 0 them(m) 
i >’) and 
x 
Y cl neg ; 
oblq N gram N? not 
vb Es 
gram pon he divided 
ind obj to + pron ^ to 
gram prep f 


Bee :Dn  them(m) 


6.2 Complex Phrases 


In our parsing of Biblical Hebrew, there are almost 2,000 distinct node labels corresponding to 
almost 2,000 different constituents. To describe every distinct constituent would be quite beyond 
the scope of the present volume. The constituents may, however, all be searched for and studied 
using our database. Our goal in this chapter is rather modest: simply to describe and provide ex- 
amples for each major complex phrase type. 

We simplify terminology and classify the constituents making up a construction as segments 
and/or phrases.? Given this simplified definition, complex phrases divide into four configurations: 


1. Segment-segment. A segment and following segment(s) combine to form a basic phrase 
(covered in chap. 5). 

2. Segment-phrase. A segment and following phrase combine to form a complex phrase. 

3. Phrase-phrase. A phrase and following phrase combine to form a complex phrase. 

4. Phrase-segment. A phrase and a following segment combine to form a complex phrase. 


6.3 Complex Tight Phrases 


6.3.1 Complex Suffixed Phrases 

There are 11 instances of complex suffixed phrases (for basic suffixed phrases, see $5.3). Two 
cases have a pronominal suffix as the first constituent of a union phrase (Jer 51:35 and Ezek 23:43). 
Phrase marker (6.2) for Jer 51:35 has the segment-phrase configuration. The complex suffix phrase 


5. A segment is, in fact, a single-constituent phrase. Rather than refer to one-constituent phrases and multiple- 
constituent phrases, we elect to refer to the former as segments and the latter as phrases. 
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(n:work / suff) is formed by joining a segment (Onn ‘violence’) with a following union phrase (pers 
pr/union / disj).^ 


(6.2) n:work ‘onn violence 


suff Nc a > " 
pers pr O 
^ and 


n:body prt 
suff 


— "NW flesh 
“3 me 


The other nine instances each involve making the suffix the first constituent in an apposition 
phrase. Phrase marker (6.3) for Prov 23:15 is typical (note that alternate parses are also defensible). 


(6.3) cl vb NEN 
oblq N gram nav" he will rejoice 
sbj n:body prt " 
gram ^ suff 35 heart 
gd "5 me 


N leds ———————————— DA also 


The suffix is the first constituent of a segment-phrase configuration. The second constituent of the 
complex apposition phrase is a basic inverted modification phrase (the other instances are found in 
Deut 4:47; Jer 3:11; Ps 9:7, 49:14, 71:16, 103:2; 1 Chr 6:11, 25:7). 

6.3.2 Complex Definite Phrases 


Here, we list the number of definite complex phrases. Below, we shall provide an example of 
each phrase type. 


Phrase Type 
“Possessive” 


Prepositional Phrase 
Nominalized Clause 
Noun-Verb Participle 
Infinitive Construct 


Other Complex Noun Phrases 


6. The semantics of the basic suffix phrase consisting of "WW ‘my flesh’ is literal and thus is body part (body prt) 
rather than human. We have made no effort to provide special handling for figurative usages. 
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6.3.2.1 The Bizarre Phrase Types: “Possessive” and Prepositional Phrase 

This subtype of complex definite phrase consists of eight rather bizarre cases. We present two 
examples. Phrase marker (6.4) shows a noun phrase from Josh 7:21, which translates literally as 
*the-tent-me'. This noun phrase is of the segment-phrase configuration. 


6.4 :bld 
( ) " def. 


"1 the 


n:bldg 


auf 77 "UN tent 


wo me 


The other six "definite possessives" appear in Lev 27:23 (twice), Josh 8:33, 2 Kgs 15:16, Isa 24:2, 
and Mic 2:12. 

The second subtype is, so far as we know, unique to 1 Sam 9:24: a definite prepositional phrase 
*the-upon-her'. Phrase marker (6.5) is of the segment-phrase configuration. 


(6.5) Mm a the 
oe upon + pron ^ 
prep liinc NN : u 
i her 


6.3.2.2 The Nominalized Phrase Types 

There are 998 complex definite phrases that involve nominalized predicative constructions. 
Nominalized predicative structures are not discussed until $8.4. As their name implies, they are 
noun phrases that contain predications. There are four types: nominalized clauses, noun-verb parti- 
ciples, infinitives construct, and other. We shall illustrate our points using an example of each type. 
All have the segment-phrase configuration. 

Nominalized Clauses. There are 51 clauses nominalized by a definite article. The nominal 
phrase in (6.6) from Gen 18:21 is an example. The clause (cl / oblq) consists of a finite verb (vb / gram) 
and a movement aim (mvt aim / gram). The wips translates the phrase ‘that has reached Me’. 


6.6 nom , 
S69) def n the 


Du WEM DENS Ma shecame 


—————— ON unto 


wem prep D 
^ me 


The critical point here is that the entire clause is nominalized (nom / nom). The definite article oper- 
ates upon the entire nominalized clause, making it definite. 

Noun-Verb Participles. There are 998 instances of this phenomenon. The nominal phrase from 
Isa 62:6 in (6.7) shows a typical construction. As was explained in $3.2.4.2, noun-verb participles 
are so called because they behave as nouns with regard to pre-context but as verbs with regard to 
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post-context. In the present example, the entire participle phrase is definite (as a noun might be), 
but the participle has an explicitly marked direct object (such as a verb might have). 


(6.7) nom ptc 


def 


'n the 
nom ptc n-v ptc 
oblq gram 


dir obj obj mk + deity 
_ 
gram prep 


Daa mentioning 
“NN [nota acc.] 


mma Yahweh 


Infinitives Construct. In our markup of the text, there are only three instances of the phe- 
nomenon in Biblical Hebrew in which an infinitive construct is nominalized and made definite. 
Phrase marker (6.8) shows the infinitive construct in Jer 22:16. 


(6.8) nom inf 
def 


'n the 


nom inf inf cst 
oblq gram 


— NYT know 


dir obj obj mk + pron 


gram prep ——.'nN [nota acc.] 


%> me 
Were there no explicitly marked direct object, nyt might well be classified as a noun. 


Other Complex Noun Phrases. There are 72 other complex noun phrases. Most are quite 
straightforward, similar to the simple example from 1 Chr 24:6 in (6.9). 


6.9 n:humn P 
(6:2) def n the 


adip scribe 


Se from + humn - 
prep ig ee 2 from 
n:humn , 
def X ^ 9 the 
Teu "? Levite 


Here we have a basic definite phrase (“the Levite") contained in a prepositional phrase contained in 
a modification phrase contained in a definite phrase. 


6.4 Complex Unconjoined Phrases 


Included among the complex unconjoined phrases are all manner of prepositional and nominal 
phrases that do not involve tight suffixation, definiteness, explicit conjoining via conjunctions, or 
implicit conjoining via juxtaposition. We consider five complex unconjoined phrase types: ' 


7. We recognize two other unconjoined phrase types: nominalized clauses and distributed apposition. The former 
involves a clause and is discussed in chap. 8. The latter involves non-tree behavior and is discussed in chap. 20. 
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. prepositional phrases, 
. construct phrases, 

. modification, 

. inverted modification, 
5. bonded substantives. 


AUN- 


The first two phrase types are consistent with traditional analyses and so will be dealt with only 
briefly. The final three are less well known and so will be treated in greater detail. 
6.4.1 Complex Prepositional Phrases 

A prepositional phrase can be complex because its preposition is compound (1,420 times), 
because its noun phrase is not a simple segment, because the preposition combines with a nominal- 
ized constituent, or any combination of the foregoing. We illustrate each situation. 
6.4.1.1 Compound Prepositions 

Phrase marker (6.10) from Exod 5:20 illustrates how a compound preposition inevitably yields 
a complex prepositional phrase. The prepositional phrase contains a joined phrase consisting of two 
prepositions. The phrase marker has a phrase-segment configuration: 


(6.10) from + humn from : 
prep juxt S D from 
DN with 


:npas Pharaoh 


6.4.1.2 Non-Segment Noun Phrase 
The most common instance of this phenomenon occurs when a preposition combines with a 
definite phrase in a segment-phrase configuration, as in phrase marker (6.11) from Gen 1:1: 


(6.11) obj mk + geog nN 


prep 
nM n:geog 


def 


[nota acc.] 


'n the 
b" ow heaven 


6.4.1.3 Nominalized Constituent 

The predominant instances of this phenomenon are infinitive construct nominal phrases, as 
phrase marker (6.12) for Gen 2:5 illustrates. The infinitive construct (inf cst/gram) and its direct 
object (dir obj/gram) form a nominal predicative construction (nom inf/oblq), which combines 
with the preposition to form a prepositional phrase (to+inf cstr/prep). 


(6.12) to peine ^^ to 
jr rig — pa ay serve 
pis. — obi p ak "hN [nota acc.] 
e ———'n the 


ITS land 
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(6.13) obj mk + humn == obj mk + humn T obj mk + humn 
app 


union/disj prep NR [nota acc.] 


Yishmael 
son 
his/him 


and 


obj mk * humn 


prep nN [nota acc.] 


-53 allof 
born of 
‘TY2 house 
i his/him 
^" and 


obj mk * val 
prep 


NN [nota acc.] 


m n:val 


d3 all of 


nip property of 


cst Ent 
n:sbst , n 
sd o= 503 silver 


i his/him 
n:humn -53 allof 
lc 
n:humn 
m ^3! male 
iw in * humn ~ in 


prep 


n:humn 
cstr 


WIS men of 


n-bidg IP3 house of 


= m NM l 
DIAN Abraham 


6.4.1.4 Elaborations 

As has been noted previously, constructions can become very (disconcertingly?) elaborate. Con- 
sider the prepositional-phrase portion of the direct object from Gen 17:23 shown in (6.13). This 
phrase contains 40 nodes, is 22 segments long, and is 7 levels deep. 

More elaborate prepositional phrases exist. Readers with access to our database can examine 
some truly extensive prepositional phrases. For example, they can inspect the long prepositional 
phrase in the phrase marker for 1 Chr 28:11—15, a phrase 132 segments long and 14 levels deep. 


6.4.2. Complex Construct Phrases 


6.4.2.1 Two-Helation Complex Construct Phrases 

If we limit a search to construct phrases that consist of a single-segment nomen regens plus any 
other two-segment phrase (constructions consisting of a segment and a phrase), then we find 12,334 
instances in Biblical Hebrew. The three top contributors with this structure consist of a construct 
of a definite phrase (6,145 times or 49.896; see [6.14]), a construct of a suffixed phrase (3,866 or 
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31.396; see [6.15]), and a construct of a construct phrase (1,979 times or 16.196; see [6.16]). This 
leaves 344 instances (2.896) not accounted for. We illustrate the three major phenomena via phrase 
markers from Genesis 34. 

Construct of Definite Phrase. Phrase marker (6.14) is a fragment from Gen 34:3: 


(6.14) obj mk + humn 
prep 


"NR [nota acc.] 


n:humn 


def ^— — ^ the 


WI gir (V) 


Construct of Suffixed Phrase. Phrase marker (6.15) is a fragment from Gen 34:20: 


(6.15) n:bldg 


estt > "WV gate of 


T m9eo9 y vp city 


suff 


Do them(m) 


Construct of Construct Phrase. Consider the fragment from Gen 34:25 in (6.16): 


(6.16) io “IW  twoof 
^X a =) sons of 


If we require each “contained node” to have more than two daughters, then we find 629 instances 
of such coordinated phrases. Phrase marker (6.17) from Job 20:17 shows a simple conjoined phrase 
contained within a construct phrase. 


(6.17) n:geog m1 wadis of 


cstr 
p* n:food 
union/disj > WIT honey 


^ and 
NAN butter 


6.4.2.2 More Elaborate Construct Phrases 

Construct phrases with an intricate structure are also common. For example, in phrase marker 
(6.18) for Exod 3:8, the nomen rectum is a coordinate phrase with six conjuncts, each a definite 
phrase.? 


8. Contrary to initial impressions, (6.18) is a tree. The four edges that cross other edges do so only because of 
layout engine flaws. Were the break-points moved to the right by appropriate amounts, there would be no crossing edges. 


72 Chapter 6 6.4 


(6.18) n:geog Dip; place of 


cstr Se 
n:humn n:humn ™ the 


union/disj def M J 
2903 Canaanite 


and 


the 


Hittite 


and 


the 


Amorite 


and 


the 


Perizzite 


and 


the 


Hivite 


and 


the 


Jebusite 


Many complex construct phrases involve recursive structures. Example: the genealogy in 1 Chr 
6:18—23 contains a very long structure, in which apposition phrases contain construct phrases over 
and over (“A son of B son of C . . .”). This page is too narrow to show the phrase marker. 


6.4.3 Complex Modification Phrases 

442 (11%) of the modification phrases are simple; 3,644 (89%) are complex. Of the complex 
phrases, 9796 involve a modification node with an out-degree of two. The remaining 396 have an 
out-degree greater than two. We show a phrase marker for each of the three main types of complex 
modification phrases of out-degree two (in $5.4.3, we discuss basic modification phrases). 
6.4.3.1 Segment-Phrase 

Segment-phrase types occur 1,617 times. The involved phrase is typically a prepositional phrase 
(87%). To allow readers to exercise concepts and to make a point about unambiguous parsing, 
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phrase marker (6.19) for the final complete clause in 2 Kgs 14:2 is shown below.? Note that the 
complex modification phrase consists of the mother's name "1217 *Yehoaddin', modified by 712 
poUrY ‘from Jerusalem’ (the structure dominated by the sbj cmp / gram node). The modification of 
the proper noun by the following prepositional phrase is indicated with confidence in this context. 
There is nowhere else for the prepositional phrase to alight. In chap. 20, we discuss contexts with 
syntactic ambiguity. 
(6.19) oa = isting T WM 
SN sbj ___, mabst DU name of 


gram cstr i na 
n:humn —> ‘DX mother 


suff 
i his/him 


sbj cmp n:humn 
— 
gram mod 


piyin? — Yehoaddin 


x m9, ls "3 from 


prep | 
Bm :D»UrY Jerusalem 


6.4.3.2 Phrase-Phrase 

This construction type occurs 1,588 times. Many types of phrase occupy the lead position. 
Phrase marker (6.20) from 1 Sam 14:3 is typical. The construct phrase is modified by a preposi- 
tional phrase. 


(6.20) — = er ——————————— 75 priest of 
in + geog " in 


prop DE | 
T») Shiloh 


6.4.3.3 Phrase-Segment 
This type of construction occurs only 266 times. The instance from Gen 24:16 in (6.21) should 
suffice. 


om "aust Le "ais! —____. mies good of 
Mn AN ANT vision 
"ND very 


9. The Kethiv text is shown. The gap at the third position from the end of the clause exists because the Qere is 
not shown. 
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Here a noun phrase having semantics of quality (n:qual) is built up by modifying (mod) a noun 
phrase of quality, the latter having been licensed by the relation of construct (cstr). 
6.4.4 Complex Inverted Modification Phrases '° 
6.4.4.1 Segment-Phrase 

This construction type occurs 623 times. Phrase marker (6.22) from 2 Kgs 23:33 is typical: 


(6.22)  nval nND hundred 


inv mod 


mval ^, 433 talent of 


cstr : 
lnc "03 silver 
6.4.4.2 Phrase-Phrase 


The phrase-phrase construction type occurs 218 times. Phrase marker (6.23) from Judg 20:17 
is typical. 


(6.23) n:humn LLL, erd 
inv mod = YIN four 
— hundreds 
thousand 
ipei V"N men 
app . 
b humn 


estt ———* "WU drawer of 


ann sword 


6.4.4.3 Phrase-Segment 
The phrase-segment construction type occurs 541 times. Phrase marker (6.24) is typical, drawn 
from Amos 4:8. 


(6.24) n:geog crd ane 
inv mod juxt AER Uny two 
wv three 
D" cities 


6.4.5 Complex Bonded Substantives 

One of the least frequently occurring and most specialized of the licensing relations is a type 
that we refer to as bonding or bonded correlation. This relation licenses constructions that approx- 
imate the form “not X, but Y" or “X and not Y." Across Biblical Hebrew, there are 62 constituents 
the formation of which is licensed in our parsing as involving bonding. 

Often one part of the bonded construction is negated in an inverted modification structure, as in 
the bonded “X and not Y" phrase from Jer 18:17 shown in (6.25). 


10. See $5.4.4 for an explanation of “inverted.” 
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(6.25) ure prt qb — 
Nam ^ and 
n-body prt -N5 not 


inv mod 
B 05 face 


For another example, consider the bonded phrase “in X except in Y" in the prepositional phrase 
from 2 Kgs 5:15 that appears in (6.26). 


(6.26) in * geog in * geog 4 
bnd prep : 


in + humn 


ARENA 
bg? Israel 


— à in 


6.5 Complex Structurally Defined Conjoined Phrases 


In taking up complex structurally defined conjoined phrases, we potentially enter a combina- 
toric thicket, since the number of combinations of constituents and their “glue” can be enormous. 
How this occurs is easy to see. Symbolize a segment by S, a phrase by P, a conjunction (usually 1 
“and”) by +, and an absent conjunction by 0. Then we can concisely represent each of the possible 
conjoined constructions. For example, a segment followed by a phrase joined by a conjunction is 
symbolized by S+P. Three phrases joined by juxtaposition (with no conjunctions as glue) are sym- 
bolized by POPOP. And so on. 

Consider the construction types possible for a two-constituent phrase. The table specifies the 
subtypes and gives their incidences. There are 8 possible two-constituent subtypes. 


Phrase Complexity 


Complex 
P+P P+S S+S 
440 | 247| 39| 2,052 
Joined ius 
2,169 


With a three-constituent coordinated phrase, these 32 patterns are possible: 
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Phrase Complexity 


Complex Basic 
P+S+S S+S+P | S+S+S 


Uni 
nion 417 
S4SOP 
Mixed l 
List SOS+P 
8 
. SOSOP 

Joined 

102 4 


6.5.1 Compound Conjunctions by Juxtaposition 

Juxtaposed conjunctions acting as glue in coordination phrases are quite rare. We find only 13 
juxtaposed conjunctions within phrases in Biblical Hebrew. Seven occur in tree phrase markers, '! 
and 6 occur in non-tree phrase markers.'* There are 11 instances of 03) ‘and also’. Phrase marker 
(6.27) from Qoh 11:2 shows one of the ‘and also’ constructions. 


(6.27) to + qty to + qty ; 
union/disj prep Te ? to 
AU npaV seven 
coord cj , 
DJ also 
to * qty 


ee to 


prep EE, 
niv eight 


It is important to bear in mind that complex clauses allow for multiple acceptable parses. Consider 
phrase marker (6.28) from 2 Chr 21:17. The complex coordination phrase was parsed to yield a 
three-constituent structure P+P+P. One might argue with perhaps superior force that its final five 
segments form a natural substructure and should be gathered to form a distinct coordination phrase: 
rUn 123 ‘his sons and his wives’. Had we adopted this parse, the larger phrase would have had the 
two-constituent structure P+P. The critical points are these: In the building up of complex phrases, 
multiple equally valid parses are often possible. The consistency with which we have made parsing 
decisions, especially in forming elaborate constructions, is uncertain. Only with the use and adjust- 
ment of our database will consistent parsing of elaborate constructions finally be achieved. ? 


11. They are in Gen 6:4 and 14:7, Ezek 21:14, Qoh 11:2, Neh 2:18, 1 Chr 10:13, and 2 Chr 21:17. 

12. These occur in 2 Kgs 24:3-4; Zech 9:1—2; Dan 11:22; Neh 6:14, 13:15; and 2 Chr 21:4. 

13. The parsing of simple constructions should be fairly consistent, because these structures were created via com- 
putational methods and rarely have required adjusting by human over-readers. For a discussion of our approaches to 
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(6.28) n:thing ___,„ ntthing 


union/disj cstr 


-93 allof 


n:thing n:thing 
app 


'" the 


wD acquisition 


nom ptc 
def 


"n the 


nom ptc n-v ptc 
oblq gram 


— ^N found 


poss to + bldg 
gram prep 


^? to 


house of 


also 


sons 


his/him 


and 


women 


| his/him 


Note the presence of the not-uncommon phenomenon of semantic heterogeneity: the first constitu- 
ent has "thing" semantics while the second and third have human semantics. 


6.5.2 Complex Union or Disjoint Phrases 


The number of constituents in a complex coordination phrase runs from 2 through 18. We will 
make no effort to catalog the union /disjoint phrases exhaustively. Rather, we will simply provide 
illustrative examples of the major subtypes. 

As our example of an extended coordinated phrase, consider (6.29) (p. 78), a 5-constituent 
coordinated phrase from Gen 15:9 with the structure P+P+P+S+S. 

The grand total of coordinated phrase subtypes possible for 2-18 constituents exceeds half a 
million. '^ Very few of these possibilities are realized. The table shows the subtypes attested for 10 
constituents and higher in tree-structured union/disjoint phrase markers. Of the myriad possible 
coordinated phrase subtype patterns, only 17 are attested. Being rare, the large coordinated phrases 
only feebly sample the range of possibilities. 


computational parsing, see E I. Andersen and A. D. Forbes, “Opportune Parsing: Clause Analysis of Deuteronomy 8,” 
in Bible and Computer: Desk and Discipline (Paris: Honoré Champion Editeur, 1995) 49—75. 
14. For the “numerically curious": the possible grand total is 2? — 2? = 524,284. 
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n:anml 
app 


n:anml 
union/disj 


(6.29) ny 


N nY 


heifer 
3-year-old one 
and 

she-goat 
3-year-old one 
and 

ram 
3-year-old one 
and 

dove 

and 


pigeon 


Number of | Possible 


Attested | Number of 


Constituents | Subtypes | Subtypes | Instances 


Instance References 


Gen 15:19-21; Num 3:36-37; 1 Chr 9:36-37; 2 Chr 
4:19-22, 31:13 


9 Gen 10:15-18; Exod 30:26-28; Josh 7:24, 15:48— 
51; 2 Sam 5:14-16; 1 Chr 1:13-16, 4:28-31, 
8:22-25, 28:11-15 
3 Josh 18:21-24; 1 Kgs 7:48-50; Jer 48:21-24 
4 Gen 10:26-29; Josh 19:18-21; 
1 Chr 1:20-23, 4:34-37 
3 Josh 19:2-6; 1 Sam 30:27-31; 2 Sam 17:28-29 
1 Exod 35:5-9 [pattern: S(+S)+P(+S)}(+PY] 
0 — 
17 131,072 1 1 Josh 19:41—46 [pattern: S(+S)'°+P] 
18 262,144 1 1 Exod 31:7-11 [pattern: P(+P)"] 


6.5.3 Complex Mixed Lists 


There are far more ways to realize a mixed list involving constituents than there are ways 
to realize the corresponding union /disjunct construction or joined construction. The 6-constituent 
phrase marker (6.30) from Neh 9:8 is typical. Its pattern is POPOP+P+P+P. Since only 647 instances 
of mixed phrases are attested in Biblical Hebrew, it is not surprising that few of the lengthy sub- 
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types are attested and that of the subtypes attested, very few are seen more than once. The table at 
the bottom of the page shows the number of times that mixed lists of a given length are encoun- 
tered. Basic and complex phrases are included. 


(6.30) 


n:humn 
mix list 


n:humn 
def 


— 8 


"v3 


the 
Canaanite 
the 
Hittite 
the 
Amorite 
and 

the 
Perizzite 
and 

the 
Jebusite 
and 

the 


Girgashite 


The four longest mixed lists are in 1 Chr 11:26—47 (45 constituents), Neh 10:15-28 (44 con- 
stituents), Exod 35:11-19 (41 constituents), and Exod 39:33-41 (41 constituents). 


Child | Child 
Count | Incidence Count | Incidence 

4 333 13 

5 55 14 

6 87 15 

7 30 16 

8 46 17 

9 17 18 

10 16 19 

11 7 20 

12 9 >20 
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6.5.4 Complex Joined Phrases 

The final complex structurally defined conjoined phrase is the joined phrase. Recall that this 
is the type of complex phrase with constituents (at least one of them is itself a phrase and none of 
them is a conjunction) that gather to form a conjoined construction licensed by juxt. There are 806 
of these phrases. 

There is evidence that lengthy joined phrases achieve integrity through identity of contained 
constituent type (segment or phrase) and through identity of constituent structure labeling. For the 
55 joined phrases having 5-35 constituents, we find: 


1. With two exceptions, they are “monotypic.” That is, they are either all segments or all 
phrases. !° 

2. For 40 of the 49 phrases (82%), the structure labeling is invariant. 70% of these invariant 
labels involve human semantics. 


The 9 phrases exhibiting structure-label variation are an interesting group. For example, among the 
25 seven-constituent or longer phrases, only Ps 148:7-12 exhibits variation. Here a joined vocative 
mostly consists of ‘and’-joined pairs praising Yahweh: sea creatures and ocean depths, fire and hail, 
snow and cloud, mountains and hills, fruit trees and cedars, animals and cattle, creepers and birds, 
kings and peoples, princes and judges, young men and young women, elders with youths. 

The incidences of complex joined phrases are as shown below. 


Incidence 


mete} N 


The four longest complex joined phrases are in Ezra 2:43-54 (35 constituents), Neh 7:46—56 (32), 
Josh 12:9-24 (31), and 2 Sam 23:24-39 (31). 

The 5-constituent phrase marker (6.31) (p. 81) from Josh 10:5 is typical of the complex 
joined phrases. 


6.6 Complex Semantically Defined Conjoined Phrases 


The distinctions between structurally defined and semantically defined conjoined phrases were 
presented in $5.6. There is no need to expand on the materials presented there. 


6.6.1 Complex Echo Phrases 


Basic echo phrases were discussed in some detail in $5.6.1. All we need to do here is provide 
a brief survey of complex echo phrases and show a parade example. We find 61 complex echo 


15. In Neh 7:66, the 5-conjunct number “42,300” has the pattern SOSOSOSOP. In 2 Sam 23:32, the 18th conjunct is 
a segment, unlike the other 30 phrasal conjuncts. The parallel in 1 Chr 11:34 suggests that 72 ‘son of’ has been lost from 
the text of Samuel. Its restoration in 2 Sam 23:32 would recover the expected all-P pattern from the present very unusual 
POPOPOPOPOPOPOPOPOPOPOPOPOPOPOPOPOSOPOPOPOPOPOPOPOPOPOPOPOPOP pattern. 
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(6.31) n:humn n:humn don king of 


juxt cstr i e^ 
DWI Jerusalem 
n:humn 


“Tn king of 


ct =~ 
emn pan Hebron 
n oid —————————— "q19n king of 
mny  Yarmut 
n Dum —————————— qon king of 
B W3 Lakish 
j ————- “Tn king of 


pip Eglon 
phrases in Biblical Hebrew. Fifty-two are represented by tree phrase markers and 9 by non-tree. 
There is only one complex triple echo, in Jer 7:4, as shown in phrase marker (6.32). 
(6. 32) n:bldg — , n:bldg . 
echo cstr DU 5»n temple of 
N mm Yahweh 
n:bldg f 
cstr D 5pm temple of 
mm Yahweh 
n:bldg 


cstr DUM naa DI temple of 
mm Yahweh 


6.6.2 Complex Apposition Phrases 


Apposition was introduced in §5.6.2. Below we provide the tallies for the apposition phrases 
classified by child count. Both tree and non-tree phrase markers are included. 


Child Child Child 
Count | Complex | Basic Count — EH Count | Complex | Basic 
ojll 6j 5| 0| 


[2| ése [22m | | 4| 
[ 3| w[ |j 


Many of the complex apposition phrases include nominalized constituents, be they nominal- 
ized clauses (nom / nom), nominalized infinitives (nom inf/oblq), or noun-verb participles (nom 
ptc / oblq). Nominalized constructions are not discussed in this book until chap. 8. We expect that 
readers will grasp the following material nevertheless. If not, they should consult the glossary or 
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sample chap. 8 and then return to this point. Phrase marker (6.33) from Prov 21:24 shows a phrase- 
terminal nominalized clause. 


(6.33) a sbj n:qual n:qual : 
oblq gram app , app ie Tt insolent 
Vi? haughty 
nom cl sbj cmp 
nom Oe — P scoffer 
sbj n:abst -— 
gram suff DV name 
N i his/him 


sbj cmp nom ptc n-v ptc ied : 
gram oblq gram "wp making 
mnr in * mntl (3 3 
gram prep 3 in 
n:mntl 


————————5 muy wrath of 


:]i71 insolence 


As the following figures reveal, nominalized constituents have a strong tendency to come at the end 
of complex apposition phrases. 


Child | Number | Number 
Initial Terminal 


In Biblical Hebrew, nominalized constituents in complex apposition phrases are almost 75 times 
more likely to be phrase-terminal than to be phrase-initial. 

When addressed at all, the traditional treatment of constructions such as we are examining here 
is to make a general statement (to the effect that nominalized constituents in apposition phrases [al- 
most always] come at the end of the phrase), provide a few examples, and leave the matter. Having 
all of the data analyzed allows us to investigate atypical instances and question why they are the 
way they are: are the exceptions clustered in the text; does their incidence seem to be controlled by 
identifiable factors; is their occurrence associated with genre; is it random? 

Phrase marker (6.34) from 1 Kgs 7:48 shows a nominalized clause (nom / nom) followed by a 
single segment in apposition to the definite phrase (n:furn / def) leading off the apposition trio. Why 
is this phrase structured this way? We note that 1 Kings has the largest number of these atypical 
instances of any book— seven in all. 
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(6.34) n:furn n:furn j 
app def mm n the 
mw table 
nom "UN which 
nom 7o 
cl loc upon + pron P 
oblq gram prep ue y upon 
1 his/him 
sbj n:food 
gram str — Dr? bread of 
n:body prt , 
def —— 0 the 
"PE face 
:ant gold 


6.6.3 Strictly Subsetting Phrases 
The final trio of related complex phrases consists of: 


1. strictly subsetting phrases (210x), 
2. strictly supersetting phrases (185x), 


3. nested phrases (16x). 


These phrase types are investigational and are to date incompletely identified in our phrase mark- 
ers. ^ Whether they prove worth completing or whether they should revert to structurally defined 
phrase types remains to be seen. They are semantic replacements for the union/ disjoint, mixed 
list, and joined phrase types, relying on the hypernym / hyponym (superclass / subclass) distinction. 
A strictly subsetting phrase is made up of constituents the order of which runs from most general 
to least. In this type of phrase, hyponym follows hypernym. For example, the first constituent of 
phrase marker (6.35) from Gen 24:10 specifies the /and, and the second specifies the city, a subset 
of the former. Num 9:11 has a three-child subsetting consisting of month, day, and time of day 
("evening"). 


(6.35) to + geog 


subset 


to * geog -bg 


prep > 
D" DIS 


to + geog 
prep 


6.6.4 Strictly Supersetting Phrases 


unto 
Aram-Naharayim 
unto 

city of 


Nahor 


In strictly supersetting phrases, hypernym follows hyponym. The order runs from least general 
to most. Phrase marker (6.36) from Judg 10:1 conveys the structure. Here, city precedes region. 


16. Hence the counts provided above are tentative at best. 
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(6.36) in + geog in + geog jn 
superset prep RN ; 
PAW Shamir 
in + geog "n 
prep 3n 
n:geog : 
cstr ^ — —* ^ mountain of 


‘DDR — Ephrayim 


6.6.5 Nested Phrases 

When there are three or more immediate phrasal constituents, the constituents might form a 
hypernym / hyponym set, but the ordering may be neither subsetting nor supersetting. We refer to 
these phrases as nested. Phrase marker (6.37) from Deut 34:6 shows this sort of structure. Here we 
have valley, land, city—a non-monotonic sequence. 


(6.37) in * geog in * geog 


— "à in 


nested prep 
D n:geog "y the 


in * geog 


prep ———————————73 an 
n:geog 
cstr c PON land of 
IRIN Moab 
pn ————————————————————9. n front of 


cstr oo 
"ips ma Beth-Peor 


6.7 Brief Summary 


Complex Phrases. A complex phrase involves more than one licensing relation. This chapter 
provides a census, with example phrase markers, of the various kinds of complex phrases attested 
in Biblical Hebrew. 

Complex Tight Phrases. Unlike the very rare complex suffixed phrases (10x) and the few 
bizarre complex definite phrases (8x), there are many “well-formed” complex definite phrases 
(1,107x), the great majority involving noun-verb participles (8996). 

Complex Unconjoined Phrases. Complex unconjoined phrases are of five sorts: prepositional 
phrases (“from the Levite"), construct phrases (“two-of sons-of Jacob"), modification phrases 
(“priest-of Yahweh in Shiloh"), inverted modification phrases (“hundred talent-of silver"), and 
bonded substantives (“neck and not face"). 

Complex Structurally Defined Conjoined Phrases. Taking up complex structurally defined 
conjoined phrases lands us in a combinatoric nightmare since there are a great many possible com- 
binations of segments, phrases, and connectives. In practice, however, few are realized. As with 
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basic structurally defined conjoined phrases, there are three sorts: union phrases (“heifer three-year- 
old one and she-goat three-year-old one"), mixed lists (“the Canaanite, the Hittite, the Amorite, and 
the Perizzite"), and juxtaposed lists (“king of Jerusalem, king of Hebron, king of Yarmut, king of 
Lakish, king of Eglon"). 

Complex Semantically Defined Conjoined Phrases. Complex semantically defined conjoined 
phrases are of five sorts: echo phrases (“Temple of Yahweh, Temple of Yahweh, Temple of Yah- 
weh"), apposition phrases (“David the king"), strictly subsetting phrases (“in the year the second, 
in the month the second, in twenty in the month”), strictly supersetting phrases (“in day of twenty 
and four to the month, in the sixth, in year of two to Darius the king"), and nested phrases (“in the 
month the first, in the year the second, in one to the month"). 


Chapter 7 


Main Clauses 


We expound on clauses by characterizing and providing examples of main clauses (chap. 7) and 
embedded clauses (chap. 8). 


7.1 The Clause 


7.1.1 Our Previous References to the Clause 

Assuming that readers came to our book with a serviceable concept of the clause, we have 
already used the word clause about 180 times. We have now reached the point where we can and 
should explain what we mean by a clause. Our glossary entry for clause reads: 


clause—Typically, a syntactic unit that includes a predicator and the clause immediate con- 
stituents that accompany it. In Biblical Hebrew, a subject plus some other constituent(s) can 
constitute a clause that has no predicator (traditionally called “nominal”; in recent literature, 
*verbless" or *nonverbal").! 


There is characteristically a need to define the foundational terms of an area using other founda- 
tional terms from the same area. This is due to the fact that the terms together form an interrelated 
system. For example, readers who looked up our glossary entry for clause, provided above, may 
have felt the need to consult a fair number of additional glossary entries: predicator, verbal, qua- 
siverbal, clause immediate constituent, constituent, subject, segment, and so on. 


7.1.2 Definitions of "Clause" in the Literature 

We see the interrelated-system phenomenon also in the definitions of the clause offered in tra- 
ditional books on the syntax of Biblical Hebrew: 

A clause normally consists of a subject and a predicate. Depending on whether the predicate is a noun 


or a verb, a clause is said to be nominal or verbal.? 


In this grammar a clause is regarded as a meaningful series of words that has a subject and a predicate.? 


1. As shown in chap. 19, some “clauses” lack both predicators and subjects! The terms in underlined Eurostile in 
our definition have all been introduced by now, so our definition should be intelligible to readers. 

2. Paul Joüon and T. Muraoka, A Grammar of Biblical Hebrew (rev. Eng. ed.; Rome: Pontifical Biblical Institute, 
2006) 525. 

3. Christo H. J. van der Merwe, Jackie A. Naudé, and Jan H. Kroeze, A Biblical Hebrew Reference Grammar (Shef- 
field: Sheffield Academic Press, 1999) 355. 
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A clause is a set of words with a subject, a predicate, and any words that modify them. The subject or 
predicate can be implied rather than stated, and the modifiers can be absent.* 


We also observe the interrelated-system phenomenon in reference works on general linguistics: 


clause A clause is a phrase that includes a predicate and all of its arguments and modifiers.? 


clause (n.) A term used in some models of grammar to refer to a unit of grammatical organization 
smaller than the sentence, but larger than phrases, words, or morphemes. ê 


clause . . . Any constituent dominated by the initial symbol S, particularly one which forms part of a 
larger structure. Clauses are conventionally divided into main clauses and subordinate clauses." 


None of these definitions inclines us to alter our glossary definition, given above. 


7.1.3 Concepts Helevant to Clause Characterization 

What is needed, however, is the explanation, integration, and extension of a concept that was 
mentioned previously with its discussion deferred until now: configurational versus nonconfigura- 
tional languages (mentioned in $1.3.3, Item #3, p. 12). We now take up this concept, discussing 
its relation to the classification and characterization of the clauses and clause immediate constitu- 
ents of Biblical Hebrew. 


7.2 Configurational versus Nonconfigurational Languages 


According to Crystal, configurational languages have “a fairly fixed word-order and hierarchi- 
cal constituent structure"* while nonconfigurational languages exhibit “fairly free word-order and 
seemingly ‘flat’ constituent structure."? The distinction was introduced into linguistics by Kenneth 
Hale.'? As David Golumbia convincingly argues,'! the existence of nonconfigurational languages 
represented a serious impediment to the transformationalists’ quest for Universal Grammar. As a 
result, the nature and frequency of nonconfigurational languages were and remain contentious areas 
of research. In the following subsections, we will answer two questions: 


1. Is Biblical Hebrew a nonconfigurational language? 
2. If it is nonconfigurational, what difference does this make to syntactic description? 


4. Ronald J. Williams and John C. Beckman, Williams’ Hebrew Syntax (3rd ed.; Toronto: University of Toronto 
Press, 2007) 172. 
5. Ivan A. Sag, Thomas Wasow, and Emily M. Bender, Syntactic Theory: A Formal Introduction (2nd ed.; Stan- 
ford, CA: CSLI, 2003) 557. 
6. David Crystal, A Dictionary of Linguistics and Phonetics (5th ed.; Oxford: Blackwell, 2003) 74. 
7. Robert L. Trask, A Dictionary of Grammatical Terms (London: Routledge, 1993) 44. This definition strikes us 
as minimally helpful. 
8. Crystal, A Dictionary, 95. 
9. Ibid., 315. Elsewhere (p. 182), Crystal explains that “sentences have a flat structure if they lack the [noun 
phrase-verb phrase] configuration.” 
10. Kenneth Hale, “Warlpiri and the grammar of non-configurational languages,” Natural Language and Linguistic 
Theory 1 (1983) 5-47. 
11. David Golumbia, “The interpretation of nonconfigurationality," Language and Community 24 (2004) 1-22. 
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Hale characterized nonconfigurational languages as involving: “(1) free word order, (ii) the use of 
syntactically discontinuous expressions, (iii) extensive use of null anaphora.” ? 

The data discussed below indicate to us that Biblical Hebrew exhibits Hale’s three defining 
characteristics. Saying this, however, does not mean that we accept any “non-configurationality 
program." Givón has registered important cautions regarding “so-called non-configurationality.” ? 


7.2.1 Free Word Order? 

Crystal makes the difference between fixed and free word-order languages quite clear: “Fairly 
fixed word-order” languages “rely on word-order as a means of expressing grammatical relation- 
ships within constructions.” ^ Crystal’s distinction can be refined by accepting Steinberger's ob- 
servation that *[m]ost languages known as free word order languages are in fact languages with 
partially free word order . . . , or rather free phrase order... ."? 

Does Biblical Hebrew exhibit partially free phrase order, or in our parlance, partially free CIC 
order? Let us first consider a concrete example and then examine the ordering of the three major 
clause constituents across all of Biblical Hebrew. 
7.2.1.1 The Ordering of S, V, O, and B (“Beneficiary”) CICs in nwy Clauses 

When we search Biblical Hebrew for all clauses containing a subject (S), a direct object (O), a 
“beneficiary” CIC (B), and an nvy-based finite verb (V), we find 21 clauses. In the table below, 
we provide an example of each of the eight different orderings of S, V, O, and B attested. 


Pattern Citation? 
Num 6:11 
Gen 3:11 
Josh 5:3 
src | 2 Chr 4:16 
2 Sam 7:11 
loc | Neh 8:16 
Exod 36:6 
Ezek 12:3 


ind obj 


neg tm int 


uon ziIoo|jzsziz|-c 


OlmuwmuwOooOou: 


clolouwiniuniwio 


<LI<SIWI<SI <I W njun 


VOC 


a. Each verb-initial clause shown involves a sequential finite verb. The interpolated constituents represented in 
lowercase are: neg = negative, voc = vocative, tm int = time interval, ind obj = indirect object, src = source of supply, 
and loc = location (see the glossary for definitions). 


12. Hale, *Warlpiri and the grammar of non-configurational languages," 5. More-recent references characterize 
configurational languages along complementary lines. For example, Trask (A Dictionary, 55—56) asserts that con- 
figurational languages “are usually characterized by fairly rigid word order and by the infrequency of discontinuous 
constituents." 

13. Talmy Givón, Syntax: An Introduction (rev. ed.; Amsterdam: Benjamins, 2001) 279-83. 

14. Crystal, A Dictionary, 503. 

15. Ralf Steinberger, “Treating ‘Free Word Order’ in Machine Translation," Proceedings of the 15th Conference on 
Computational Linguistics (Stroudsburg, PA: Association for Computational Linguistics, 1994) 69. 

16. We dismiss from consideration a clause in 1 Sam 20:2 purporting to have the pattern “loc B V S O" because 
the beneficiary CIC is forced due to our use of the Kethiv text. 
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Out of a possible 24 permutations of 4 constituent types, only 8 are attested. Three of the 4 con- 
stituents are attested at each clause position. This example exhibits partial free CIC ordering. 


7.2.1.2 The Ordering of S, V, O in the Clauses of Biblical Hebrew 
Next, consider the ordering of S, V, and O across all of the clauses of Biblical Hebrew. " 


SVO VSO OSV 
2,420 (33.5%) | 3,085 (42.7%) 166 (2.3%) 


SOV VOS OVS 
145 (2.0%) 1,043 (14.4%) 364 (5.0%) 


For these data, the oft-repeated assertion that Biblical Hebrew is a VSO language is an insufficient 
description. The VSO sequence occurs in only 43% of the clauses containing an S, a V, and an O. 

In passing, consider ordering in the subset of these sorts of clauses having unanchored predica- 
tors (see §11.2). For these, our tallies yield: 


SVO VSO OSV 
2,387 (44.8%) | 1,611 (30.2%) 166 (3.1%) 
SOV VOS OVS 
143 (2.796) 662 (12.496) 364 (6.8%) 


These clauses are predicator initial (^ V-initial") 43% of the time. '® 
Biblical Hebrew does exhibit substantial free CIC ordering. Hale's characteristic (i), "free 
word order," is true of Biblical Hebrew. 


7.2.2 Syntactically Discontinuous Expressions? 
In $4.2, the third of the four conditions that a tree-structured phrase marker must obey was: 


3. Non-tangling Condition. Edges may not cross. 


When edges are tangled (i.e., when they inescapably cross one another), then the underlying con- 
stituents are said to exhibit discontinuity.?? Every phrase marker shown until now, with one excep- 
tion, has been a tree and has not exhibited tangling.?' This is because we have selected phrase 
markers to make it so. But, consider phrase marker (7.1) from Ruth 1:1b. 

At issue here is the extent of the subject of the clause. There is concord between the finite 
verb segment 77 and the following segment wx, which clearly is at least part of the subject. The 
problem is what to do with the trailing coordinate phrase: v33 301 inUN] 817. The segment 817 in 


17. In obtaining these tallies, we set aside indirect objects, complements, objects and subjects of address, and pe- 
ripheral elements. See $11.3.2.5. 

18. When we examine clauses with an unanchored predicator plus either a subject or an object, initial-predicator in- 
cidence increases substantially. The VO sequence occurs in 80% of cases, and the VS sequence occurs in 62% of cases. 

19. We say more about all this in $11.3, where we take up positional syntax. 

20. For example, in the clause “Wake him up,” one analysis takes the finite verb “wake up" to be rendered discon- 
tinuous by the interposed direct object “him.” 

21. Because of imperfections in graphic layout, some phrase markers contain edges that cross one another but that 
would not do so were the layout better. The single tangled phrase marker is at the end of §3.2.4, phrase marker (3.6). 
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this coordinate phrase is co-referential with W°}, and so they might each be given an index to indi- 
cate identity of reference, yielding ,W°x and Xin. This is a standard technique for identifying items 
without having to introduce crossing edges. But the coordinate phrase (“he and his wife and his 
two sons") is not co-referential with wx, so this simple artifice is not immediately available. Since 
we are not constrained to use only trees for our phrase markers, we rely on tangling in situations 
such as this. We dominate both v^s and 132 303 IAW) N11 with a node showing that they combine 
to form a noun phrase having human semantics, the semantic licensing relation between the two 
dominated constituents being that of superset (n:humn/superset). That is to say, 373 in)N! Nin 
733 refers to a superset of the referent of WX. Note that we have relied on semantic relations in the 
formation of our phrase markers. 


7.1 cl dl seq w ^ 
( ) oblq NN gram | and 
vb > 
gram T» he walked 
sbj n:humn k 
gram superset WR man 


H from 


mvt orig from + geog 
— 
gram re 


NNS ngeog ss 
cstr —— bo 

nnm Judah 
aim to +inf estt O —— ooa 5 


gram prep to 


scorn 


loc in + geog 
gram prep 


——————— "à in 


field of 


Moab 


pers pr 


union/disj he 


and 


woman 


his/him 


and 


two of 


sons 


his/him 


In addition to the introduction of indexation, other strategies for avoiding overt representation 
of discontinuity have been devised over the years. Chomsky introduced “a second level of descrip- 
tion defined by transformations to account for facts which his predecessors had employed discon- 
tinuity to handle."? Another avoidance strategy “assumes that noncontiguous syntactic elements 
cannot form a syntactic constituent, although they may be mapped onto a semantic representation 


22. Geoffrey J. Huck and A. E. Ojeda, Syntax and Semantics: Discontinuous Constituency (Orlando, FL: Aca- 
demic, 1987) 3. 
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in which their translations form a unit."? Our approach strives to supply a What-You-See-Is-What- 
You-Get representation, avoiding these sorts of multiple levels of representation. 

Discontinuity is also associated with a second phenomenon: distributed apposition, wherein a 
later constituent is in apposition to two or more earlier constituents considered together. Consider 
the direct object clause immediate constituent (CIC) in (7.2) from 1 Sam 31:2. 


(7.2)  dirobj obj mk + humn obj mk * humn 
gram union/disj prep 


TR [nota acc.] 


n:humn PEN 
—————— 

dst app Inm —Yehonatan 

^ and 


obj mk * humn TR [nota acc.] 


Ss ITIN Abinadab 


^ and 


obj mk * humn 


prep TIN [nota acc.] 


n:humn 


dst app ywo  Malkishua 


n:humn 


cur 43 sons of 


lc SV Saul 
The referent of ‘Yehonatan’, ‘Abinadab’, and ‘Malkishua’ taken together is ‘sons of Saul’. The 
resulting constituents transgress the tangling condition.?* Syntactic representation of Biblical He- 
brew involves discontinuous constituents. For a table showing the relative incidence of discontinu- 
ity as a function of licensing relation, see $20.1.2. There we see that discontinuity occurs in 9.896 
of the instances of “interruptable constituents." ?? Hale's characteristic (ii), “the use of syntactically 
discontinuous expressions," is true of Biblical Hebrew. 
7.2.3 Null Anaphora? 
According to Hale, the third characteristic of nonconfigurational languages is "extensive use of 
null anaphora,” by which he refers “to constructions in which an argument (i.e., subject or object 
of a verb) is not represented by an overt nominal expression in [Phrase Structure ]."?* 


23. Ibid., 5. 

24. Since the n:humn /cstr node has three mother nodes, this phrase marker also disobeys the Single-Mother Condi- 
tion for trees: any non-root node has only one mother. In Biblical Hebrew, distributed apposition is uncommon but real. 
It occurs 155 times. 

25. The following licensing relations intrinsically do not permit discontinuity: definite, preposition, suffixation, and 
(with 9 exceptions) construct. 

26. As reported by Golumbia, “The interpretation,” 6. 
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Chapter 7 
7.2.3.1 


Null Subjects 


With regard to the subject, there is no question that Biblical Hebrew exhibits extensive null 
anaphora since it “is a prototypical example of a pro-drop language.””’ As Cote puts it: “[I]n lan- 
eliminating the need for an overt subject. 


guages with ‘strong’ inflection, the subject case may be absorbed by the inflective morphology, 
2928 


This phenomenon is very common in Biblical Hebrew. 
book, phrase markers for clauses lacking overt subjects are, among others, (3.2), (4.2), (4.9), and 
7.2.3.2 Null Objects 


For example, we find 2,272 clauses in Biblical Hebrew that consist of only a finite verb. In this 
(5.3). Recall that our definition of clause does not stipulate that it must include an overt subject. 


Non-overt direct objects in Biblical Hebrew do exist. Consider the case of iwy. This imperative 
verb form occurs 30 times in Biblical Hebrew, 21 times with a direct object. Thus, it is typically 
transitive. Four times the clause in which the imperative occurs involves a comparison (Gen 19:8; 
Jer 26:14, 50:15, 50:29), and three times the imperative is part of a set expression (Hag 2:4; 2 Chr 

19:7, 19:11), W9) . . . prm “be strong and do!” (twice) and Ww NAW “keep and do!” (once). The 
remaining two instances (2 Sam 3:18 and 1 Kgs 18:25) involve non-overt direct objects. Consider 
phrase marker (7.3) from 1 Kgs 18:25. 
(7.3) 


sent cl vb 
union/disj ^ oblq M gram na  you(m) choose 

LI 
^ benf to * pron ; 
" 
x gram prep á ? to 

M Mo 

M 02 you(m) 
\ \ dir obj n:anml n:anml , the 

VN gram app def aaa 

` 

1 \ E 

AA 39 bull 

‘ 4 

` ` 

VAN n:qty 

\ \ def c n the 

` LI 

‘ LI 

Iu INN one 

\ \ z 

‘ \ 

\ rien one m a mm D ce yes jm ela ea oa ee area »^ and 

\ 

y cl vb 

—— 
oblq gram 
TN tm pt 


wy — you(m) make 


MNI first 
In the second clause, it is “the one bull” that Elijah commands the prophets of Baal to “make,” even 
though this is not evident from the phrase marker as presently configured.” 


27. Robert D. Holmstedt, The Relative Clause in Biblical Hebrew: A Linguistic Analysis (Ph.D. diss., Univ. of Wis- 
position of a verb, particularly subject position, can be left empty.” 
of Pennsylvania, 1996) 10. 


consin, 2002) 185-86. According to Trask, A Dictionary, 219: “pro-drop . . . The phenomenon in which an argument 


28. Sharon A. Cote, Grammatical and Discourse Properties of Null-Arguments in English (Ph.D. diss., University 


29. In chap. 20, we will see that drawing an edge from the lower cl/oblq up to dir obj/gram encodes this situation 
but violates the Single-Mother Condition (since dir obj / gram will have two mothers), making the phrase marker a graph. 


7.2 
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Perhaps a second example will prove helpful. Consider phrase marker (7.4) for Ps 78:21a. 


(7.4) unit 

CN ©. > 12) therefore 

Ni sent | cl vb . 
union/disj ~~ oblq gram — YAY he heard 

«X N 

A sbj i 

N gram > Yahweh 
X M ————— > "n and 
v al vb 


— ayn’ he was angry 


The suffixed Qal active verb yw is transitive in a little over one-third of its occurrences (transi- 
tive 28 times out of 74 occurrences). When transitive, it is usually translated ‘he heard’; when it 
is intransitive, it is often translated ‘he listened’. In Ps 78:21a above, it is (properly) translated as 
transitive, even though there is no overt direct object. The antecedent of the non-overt direct object 
is the accusation that the people spoke against mm in vv. 19 and 20: 


DD imr) ANAT 7 3723 NW qp? Ox 59v 
inp? WNW PIOR nn 53» orn 320v? onn 


Since there is extensive use of null anaphora in Biblical Hebrew, Hale's characteristic (iii), “exten- 
sive use of null anaphora,” is met. 

There is considerable evidence that "languages exhibit different mixtures of morphological 
and/or structural marking of functional information, yielding a continuous scale along the dimen- 
sion of (non-)configurationality."?? Since all three of Hale’s characteristics are found, we conclude 
that Biblical Hebrew is, to some yet-to-be-determined extent, a nonconfigurational language. Where 
Biblical Hebrew lies on the nonconfigurationality gradient is a topic for research. 

That Biblical Hebrew exhibits each of the characteristics of nonconfigurational languages has 
implications for the way that we represent and describe the constituents making up clauses. 


7.2.4 Implications of Free Phrase Order 


Given that Biblical Hebrew exhibits some free CIC order, the topic of constituent order is much 
broader than is the case for configurational languages. We take up constituent order in chap. 11. 


7.2.5 Implications of Representing Discontinuity 

Discontinuity throws up technical challenges when it comes to storing and searching the result- 
ing phrase markers. These issues are beyond the scope of this book and have, in any case, already 
been dealt with by the database implementers. Because the tree conditions listed in $4.2 are vio- 


lated, we are required to decide which phenomena we will represent via non-tree phrase markers. 
See chap. 20. 


30. Anette Frank, “Projecting LFG F-Structure from Chunks—or (Non-)Configurationality from a Different Point 
of View," Proc LFG03 Conference (2003) 217—37; on the web at http://cslipublications.stanford.edu/LFG/8/1fe03.pdf. 
See also Carl Pollard and Ivan A. Sag, Information-Based Syntax and Semantics, vol. 1: Fundamentals (Stanford, CA: 
CSLI, 1987) 188. 
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7.2.6 Implications of Null Anaphora 


The existence of null anaphora affects both the theory and terminology that we adopt for de- 
scribing clause immediate constituents (CICs), as will be seen in chaps. 9 and 10. 


7.3 Complements and Adjuncts 


We conclude this chapter by turning to the distinction between clausal complements and 
adjuncts. 
7.3.4 The Standard Definitions 


We begin by introducing the distinction between complements and adjuncts. A common parti- 
tioning of CICs is embodied in these definitions from van der Merwe, Naudé, and Kroeze: *! 

Complement . . . this term refers to an obligatory element in a construction. On the syntactic level 

complements refer to obligatory, non-omissible, non-verbal parts of the predicate or verb phrase (VP). 


Adjunct... The term . . . refers to an optional or secondary element in a construction. On the syntactic 
level adjuncts refer to optional, omissible, non-verbal elements in the predicate or verb phrase (VP). 


The definitions provided by Waltke and O'Connor assert a similar contrast: ? 


[Complement:] an element in a grammatical construction that completes the predicate, notable types 
being objects and some adverbials. 


[Adjunct:] an optional or less important element in a grammatical construction, notable groups of 
adjuncts being adverbs and some prepositional phrases. 


How these four definitions, plus additional definitions supplied by four linguistics authorities, ? 
characterize the complement-adjunct distinction is summarized in the following table. 


Complements Reference to Distinction Made 
Authority Obligatory? Predicate? Specific ? 
van der Merwe et al. 
Waltke and O'Connor 


Crystal 

Matthews 

Trask Not All Yes No 
Pollard and Sag Not All Yes Attempted 


The consensus is that: 


1. Complements are obligatory. 
2. The domain of complements and adjuncts is the predicate. 


31. Van der Merwe, Naudé, and Kroeze, Biblical Hebrew Reference Grammar, 355. 

32. Waltke and O'Connor, /BHS, 690 and 689. Elsewhere (p. 163), they observe that “a strict delimitation [between 
complement and adjunct] is not possible.” 

33. Crystal, A Dictionary, 88 and 11. P. H. Matthews, Oxford Concise Dictionary of Linguistics (Oxford: Oxford 
University Press, 2005) 63 and 8. Trask, A Dictionary, 51 and 8. Pollard and Sag, Information-Based Syntax, 134-39. 
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3. The distinction between complements and adjuncts is, in practice, vague. 
We now discuss our reservations about each of these assertions. 


7.3.2 Optional Complements 


The existence of null objects in Biblical Hebrew (§7.2.3.2) renders the obligatoriness assertion 
inexact. Optionality has been commented on in more-general contexts. For example, in a book 
section entitled “Optional Complements,” Pollard and Sag observe that "[c]ertain complements 
are associated with semantic roles that are ontologically necessary, so that even when the comple- 
ment in question is not overtly expressed, the situation described must be one where some object 
plays the role in question.” ** The usual clause used to exemplify ontological necessity is “He ate." 
Something must have been eaten, even if its nature is not explicit in the clause. /n Biblical Hebrew, 
complements may be optional. 


7.3.3 The Domain of Complements and Adjuncts 


In $1.3.3, Item £5 (p. 12), we retired the concept of “verb phrase" but not the concept of 
predicate. It is the invariant practice of writers explaining complements and adjuncts to locate their 
domain in the predicate portion of the clause. The critical observation for our purposes, however, 
is this: in nonconfigurational languages such as Biblical Hebrew, a predicate construction is not 
useful in the way that it is for configurational languages. This is because "in [Biblical Hebrew] we 
have an example of a language in which sbj and obj are rather similar to the same concept in more 
familiar languages, but cannot be distinguished in terms of being part of a VP complement."? As 
remarked in $7.2, nonconfigurational languages have phrase markers, the top levels of which are 
flat structures. In them, it is not helpful to analyze a clause into an NP subject and a VP predicate. 
As a result, we assert: for Biblical Hebrew, the domain of complements and adjuncts is the clause.” 


7.3.4 Making the Complement-Adjunct Distinction Specific 

Most texts vaguely describe adjuncts as being “secondary to" or “less important than" comple- 
ments. They do not provide operational diagnostics for distinguishing complements from adjuncts. 
One exception to this nonspecificity is provided by Pollard and Sag.?' They give “a number of 
rough-and-ready syntactic and semantic diagnostics which usually serve to make the distinction" 
between complements and adjuncts. Here, in their own words, are their diagnostics: 


1. Order-dependence of content: The contribution of adjuncts to semantic content can depend 
upon their relative order in a way that does not apply to optional complements. 

2. Constancy of semantic contribution: In general, a given adjunct can occur with a relatively 
broad range of [verbs] while seeming to make a more-or-less uniform contribution to 


34. Ibid., 132-34. See also Kordula De Kuthy and W. Ditmar Meurers, “Dealing with Optional Complements in 
HPSG-Based Grammar Implementations,” Proc. HPSG2003 Conf., 88—96. 

35. Y. N. Falk, Lexical-Functional Grammar: An Introduction to Parallel Constraint-Based Syntax (Stanford, CA: 
CSLI, 2001) 21. We have replaced “Wambaya” in the quotation with “Biblical Hebrew." 

36. With regard to complements and adjuncts, however, the clause can be usefully—up to a point—subdivided into 
a nucleus, a core, and a periphery. See $11.3.2 and $192. 

37. Carl Pollard and Ivan A. Sag, Information-Based Syntax, 135-39. 

38. Idem, adjunct examples: “Kim jogs reluctantly twice a day" versus “Kim jogs twice a day reluctantly.” Optional 
complement examples: “Kim complained about the neighbors to the landlord” versus “Kim complained to the landlord 
about the neighbors." 
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semantic content across that range. A given optional complement, by contrast, is typically 
limited in its distribution to co-occurrence with a small (and often semantically restricted) 
class of [verbs] (possibly even a single item).*° 

3. Iterability: In general, two or more instances of the same adjunct type can occur with the 
same [verb], but this is impossible for complements. ^? 

4. Relative order: In English, some adjuncts tend to be ordered after complements.*! 

5. Possibility of internal gaps:” At least some adjuncts appear to generally disallow 
unbounded internal traces. . . . Complements, by contrast, generally allow internal gaps. ? 


Note that practical application of these diagnostics presupposes that one has reliable methods for 
judging “contribution to semantic content" and sufficient data to assess patterns of occurrence “in 
general." While it might be possible to devise methods for judging "contribution to semantic con- 
tent," the amount of data available for Biblical Hebrew is limited. Hence, statistically significant 
inferences about complements versus adjuncts are limited. For example, of the 1,573 verb lexemes 
in our dictionary for Biblical Hebrew, 1,007 occur 10 times or fewer times. Given so few instances, 
reliable inferences will not be possible for two-thirds of the verb lexemes. We will be able to make 
robust inferences for the verb lexemes that appear frequently, and this may be sufficient. One hun- 
dred fourteen of the verb lexemes (7.2%) occur more than 100 times. For these, it should be pos- 
sible to investigate the complement-adjunct distinction reliably. 


7.4 An Alert regarding “Marginal Analyses" 


The second table of $7.2.1.2 is reproduced below. It was presented above to make the point 
that—for unanchored verbs across the totality of Biblical Hebrew—the grammatical function se- 
quence VSO is not preferred over the SVO sequence but, rather, the reverse is true: 


SVO VSO OSV 
2,387 (44.8%) | 1,611 (30.2%) 166 (3.1%) 


SOV VOS OVS 
143 (2.796) 662 (12.496) 364 (6.8%) 


The examination of the GF sequence might have gone further. We might have searched out the fac- 
tors that affect the sequences observed. To dig deeper into the GF-ordering puzzle, we might next 
have inquired whether text type had any effect on GF ordering. We might have tallied, as a function 


39. Adjunct example: “Kim camps / jogs / meditates on the hill.” Optional complement example: “Kim depends /re- 
lies on Sandy.” 

40. Adjunct example: “Kim and Sandy met in Baltimore in the lobby of the Hyatt in July.” Optional complement 
negative example: “*Yes we have no bananas, pineapples.” 

41. Adjunct example: “Butch apologized to his computational ichthyology seminar because coelocanths were un- 
decidable.” Optional complement negative example: “*Butch apologized because coelocanths were undecidable to his 
computational ichthyology seminar.” 

42. Trask, A Dictionary, 114: “gap ... A location in a sentence in which no element is overtly present even though 
some element appears to be in some sense grammatically required.” 

43. Adjunct negative example: “*Which endangered species did Sandy meet someone [fond of. |?" Complement 
example: “Which endangered species did Kim impress you as being most [fond of. |?" 
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of gross text type, the incidence of clauses found to be subject-initial, predicator-initial, or object- 
initial. This would have led to a table of the sort shown below. 


Gross Text Type Subject-Initial | Predicator-Initial | Object-Initial Row Total 
Narrative 367 386 104 857 
Human-to-human speech 812 583 165 1,560 
Human-to-God speech 312 200 49 561 
Divine speech 768 512 123 1,403 
Column Total 2,259 1,681 441 4,381 


We would observe, for narrative, that initial-subject clauses are slightly outnumbered by initial- 
predicator clauses but that, for the other three text types, the initial-subject clauses outnumber the 
initial-predicator clauses by significant amounts.“ Further exploration would be called for to deter- 
mine why this should be. 

Usually in this book, we reduce the complexity of the situations being described in order to 
simplify exposition. This is standard practice when one is doing “exploratory data analysis,” as we 
are in general. To probe our data fully would have unduly expanded and delayed the publication of 
this volume. Readers are urged to keep this author-imposed limitation in mind and encouraged to 
view it as an invitation to further research. 


7.5 Brief Summary 


Our Definition of "Clause." Our definition of clause stands up well when compared with 
those found in grammars of Biblical Hebrew and in reference works on general linguistics. 

On Nonconfigurationality. Biblical Hebrew meets all three of Hale’s characteristics of non- 
configurational languages: free phrase order, use of discontinuous constituents, and frequent use of 
null anaphora. Free phrase order is seen across the corpus, especially when unanchored predicators 
are involved. Discontinuity occurs about 10% of the time in constituents where it is admissible. 
And, Biblical Hebrew, being a pro-drop language, exhibits frequent null anaphora. 

Complements and Adjuncts. The distinction between complements and adjuncts is vague but 
useful if one allows complements to be optional, as we do—contrary to the usual definition. Ways 
of making the distinction semirigorous have been proposed but have not yet been successful. 

Marginal Analyses. We are all-too-well aware that some readers will be frustrated by our not 
examining the influences of potentially relevant explanatory variables on the phenomena described 
in this volume. 


44. Technical note: This impression is validated by contingency table analysis. For narrative, the standardized re- 
siduals for initial-subject clauses are significantly positive, while individuals for the initial-predicator clauses are signifi- 
cantly negative (chi-squared test with 6 degrees of freedom and at the 99% confidence level). 


Chapter 8 


Embedded Clauses 


In chap. 7, we focused on main clauses. Here, we take up four kinds of embedded clause: 


. nominalized clauses (with and without Wx), 
. clausal complements, 
. subordinated clauses, 
. clause-like structures. 


PUN 


8.1 Nominalized Clauses 


A few of the phrase markers shown thus far contain embedded nominalized clause constitu- 
ents, identified by their telltale dominating nom/nom nodes. See phrase markers (3.1), (3.4), (3.5), 
(3.10), (6.1), (6.6), (6.33), and (6.34). 


8.1.1 The Representation of Nominalized Clauses 
In Biblical Hebrew, we find 6,873 nominalized clauses. ! They divide into two groups: 


Group #1: clauses preceded by an overt nominalizer, usually Wx,’ 
Group #2: clauses not preceded by an overt nominalizer.? 


The constituent in (8.1) from Gen 3:1 shows a Group #1 nominalized clause (hence, marked by a 
preceding 1Wx). Note that our practice is to place the WX outside the embedded clause. We view 
Wk as an operator? that nominalizes a following clause or sentence.” 


1. We studied simple nominalizer incidence in Francis I. Andersen and A. Dean Forbes, “ ‘Prose Particle’ Counts in 
the Hebrew Bible,” in The Word of the Lord Shall Go Forth: Essays in Honor of David Noel Freedman in Celebration 
of His Sixtieth Birthday (ed. Carol L. Meyers and M. O'Connor; Winona Lake, IN: Eisenbrauns, 1983) 165-83. 

2. R. D. Holmstedt, The Relative Clause in Biblical Hebrew (Ph.D. diss., University of Wisconsin, 2002) addresses 
clauses introduced by segments other than Wx (e.g., -11 ir ,nr ‚n, and -w), a topic that we do not take up except for —n 
as a nominalizer ($6.3.2.2). These alternate nominalizers are marked and hence searchable but are not discussed by us 
(see $3.2.1.1). 

3. This would be referred to as the “null relative word case" by Holmstedt, The Relative Clause, v. 

4. R. L. Trask, A Dictionary of Grammatical Terms (London: Routledge, 1993) 195: "Any grammatical element 
which bears a scope relation to some part of a sentence." 

5. A nominalizer may scope more than a clause. In phrase marker (6.1), the nominalizer scopes a sentence. 
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(8.1) n:anml —, nanml by 


app cstr all of 


cstr —— ————— mn wild animal of 
En n:geog c the 


nW field 


i WN which 
d oca V» n'y he made 


oblq gram 
SS sbj — n:deity — nmm Yahweh 


DN God 
The constituent in (8.2) from Ps 118:24 contains a nominalized clause from Group £2. It lacks an 
TUN. 


(8.2) n:time n:time 


$$ > —-> 7 the 


N x Di day 
nom cl vb he made 


nom oblq gram Wy 


ao sbj 
gram Yahweh 


8.1.2 The Distribution of Nominalized Clauses 


The distribution of Group #1 (“Overt Nominalizer") and Group #2 (“Null Nominalizer") clauses 
across Biblical Hebrew can be presented graphically. To do so, we first gather the data shown in 
the following table. (“Section Size” is the total number of clauses of any type found in the section.) 


Overt Null Incidence Section 
Nominalizer | Nominalizer Counts Size 


19,593 


Former Prophets 


Latter Prophets 


11,462 
Other Writings 10,215 
Total 78,775 


We then compensate for the effects of section size by dividing the overall incidence of nominalized 
clauses in a portion by a measure of the section's size (“Relative Incidence"). We let the number of 


6. That is, the relative incidence measures the frequency of nominalized clauses as a proportion of the total number 
of clauses in the section. 
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clauses in each section of the MT measure its size. The effects of section size may also be removed 
by forming the ratio of overt nominalizer to null nominalizer counts (“Overt-to-Null Ratio"). The 


following pair of indicators results. 


Relative Overt-to-Null 

Incidence Ratio 
Torah 0.11 17.81 
Former Prophets 0.10 18.19 
Latter Prophets 0.07 4.66 
Poetry 0.03 LAT 
Other Writings 0.10 8.93 
Overall 0.09 8.15 


To grasp the significance of these measures, consider the situation observed for the Torah. There we 
find that one in 10 clauses is a nominalized clause. We also find that 18 times as many nominalized 
clauses have an overt nominalizer segment as clauses that do not. Figures 8.1 and 8.2 make the 
comparative situation across Biblical Hebrew more immediately obvious. 
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Fig. 8.1. Relative Incidence of Nominalized 
Clauses. 
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Fig. 8.2. Overt-to-Null Nominalizer Ratio. 


In the Poetry section (Psalms, Job, and Proverbs): 


1. Embedded nominalized clauses occur far less frequently in the poetry section than they 
do elsewhere (left plot). For example, they occur almost four times less frequently in the 
poetry section than in the Torah section. 

2. Overt-nominalizer clauses occur far less frequently than null-nominalizer clauses as 
compared with other sections of Biblical Hebrew (right plot). For example, they occur 
more than 15 times less frequently in the poetry section than in the Torah section. 
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8.1.3 Other Characteristics of Nominalized Clauses 
Taking our lead from Holmstedt, we consider these four topics related to nominalized clauses:’ 


1. overtly and covertly headed nominalized clauses (“dependent” versus “independent’), 

2. restrictive and nonrestrictive nominalized clauses (“essential” versus “non-essential’’), 

3. the resumption phenomenon wherein a pronoun in a nominalized clause refers back to the 
substantive that the nominalized clause apposes, 

4. nominalized clause extraposition wherein the nominalized clause does not immediately 
follow the substantive that it apposes. 


Dealing with these phenomena in detail is beyond the scope of this volume, but we will provide 
phrase markers illustrating each. Readers desiring comprehensive information should consult a 
standard grammar or, better, Holmstedt' s thesis. 

8.1.3.1 Overtly and Covertly Headed Nominalized Clauses? 

In phrase marker (8.2) from Ps 118:24 (above), the nominalized clause is in apposition with an 
immediately preceding overt head—namely n?n ‘the day’. The nominalized clause is dependent. 
By contrast, phrase marker fragment (8.3) from Gen 18:19 provides an example of an independent 
nominalized clause. It is in relation to no preceding substantive. 


(8.3) obj mk + nmizr 
prep 


ng [nota acc] 


nom "UN which 


—— 3" he spoke 


N ind obj upon * pron "by upon 


gram prep id 


8.1.3.2 Restrictive and Nonrestrictive Nominalized Clauses? 
Holmstedt contrasts restrictive and nonrestrictive clauses as follows: 
[A] restrictive [nominalized clause] provides information about its head which is necessary for iden- 


tifying the exact referent of the head; a non-restrictive [nominalized clause] presents additional infor- 
mation about its head that is non-crucial for identifying the referent of the head. '? 


As instances of the phenomenon of restrictiveness, we use two examples that are discussed by 
Holmstedt.!! As an instance of a restrictive nominalized clause, he supplies Gen 1:7. The relevant 
phrase marker fragment shown in (8.4) has an overt head and an overt nominalizer. 


oq 6 


7. Holmstedt, The Relative Clause, passim. We uniformly replace Holmstedt’s 
ized clause.” 

8. Ibid., 66-79. 

9. Ibid., 114-25. 

10. Ibid., 114. 

11. Ibid., 115. 


relative clause” with “nominal- 
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(8.4) neg E ——» 53) the 
D'U3 water 
enm TUN which 
Nd uo hum — ton fom 
e nnm underneath 
^?) to 
nap ———'Q the 
N wyp expanse 


The nominalized clause clearly restricts the reference of wan ‘the waters’. As a nonrestrictive 
example, consider (8.5) from Exod 18:9, which has an overt head and an overt nominalizer. The 
nominalized clause is non-restrictive, since 987” is a proper noun and, as such, is not eligible for 
restriction. 


(8.5)  nhumn bgg? Israel 
bs nom Wx which 
N k LI tud xn he delivered 
N ps = i his/him 
pina EN from ie prt ‘N from 
h" dese pt. T ^ handof 
Se ip" Egypt 


8.1.3.3 Resumption Phenomena 

As noted at the outset of §8.1.3, the resumptive phenomenon " involves a pronoun in a nomi- 
nalized clause that refers back to the substantive to which the nominalized clause is in apposition. 
Consider phrase marker (8.6) from Isa 37:34 (p. 103). Here, the nominalized clause is in apposition 
to TYT: ‘the path’. But 7 - ‘her’ is the anaphor of ‘the path’. Thus, by the definition, this construc- 
tion is “resumptive” (it also has an overt head, an overt nominalizer, and is restrictive). 


12. Ibid., 90-107. 
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(8.6) in + geog 


prep 2 in 
n:geog n:geog ; 
app def the 
TII path 
nom -— . 
nom WN which 
cl vb 

loc in + pron "^ in 


vm iiid o HN 
no her 


8.1.3.4 Nominalized Clause Extraposition ? 

According to Crystal, '^ extraposition “refers to the process or result of moving (or extraposing) 
an element from its normal position to a position at or near the end of a sentence." Holmstedt gives 
the locations of 139 instances of extraposed nominalized clauses. In 63 of his cases, we do not 
find an extraposed nominalized clause. These divergences result from differing analyses, especially 
our allowing phrases to be heads for nominalized clauses. Consider phrase marker (8.7) from Gen 
1:11. Our analysis makes the phrase ir? 778 the head for the nominalized clause. As a result, the 
(restrictive) nominalized clause (nom / nom) is not extraposed from its head. 


(8.7) pues . veg "15 fruit 
N to + abst , 
ju ‘I kind 
= i his/him 


: TUN which 


nom 
N cl sbj n:veg 'ya seed 


oblq gram suff 
"i his/him 


loc in * pron : 
pron iin 


gram did T 
i1 his/him 


We find 94 instances of extraposed nominalized clauses that are not in Holmstedt's list. Each in- 
volves distributed apposition (introduced toward the end of $7.2.2). Phrase marker fragment (8.8) 


13. Ibid., 290—93, 301-5. 

14. David Crystal, A Dictionary of Linguistics and Phonetics (5th ed.; Oxford: Blackwell, 2003) 174. Emphases 
removed. 

15. Holmstedt, The Relative Clause, 301, footnote. 
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from 1 Chr 22:13 shows the phenomenon. Both miri and p"osurn are in distributed apposition to 
the following restrictive nominalized clause. Because of the presence of mosuUnirnm), the nominal- 
ized clause is not adjacent to o"prin. It is inherently extraposed. 


(8.8) obj mk + vocbl ^ , obj mk + vocbl 


union/disj prep NR [nota acc.] 


"n the 


le" wpn rules 


"^ and 


obj mk * vocbl 


prep TIN [nota acc.] 


N n:vocbl 


det app n the 


D'v23vm judgments 
WN which 
N a iy he commanded 


mm Yahweh 


dir obj obj mk + humn TIN [nota acc.] 


gram prep 
e nb Moses 
refrnt upon * humn -by upon 


gram prep 
Mnt Sew lerael 


8.2 Clausal Complements 


A second kind of embedded clause is the clausal complement. When a cognitive / mental verb 
is involved, its clause is commonly said to encode "indirect speech." We have identified 872 of 
these embedded clauses in Biblical Hebrew. They are of three sorts. We classify them as: 

1. clausal cognitive complements of cognitive / mental verbs (805x), 


2. clausal complements in interrogative expressions (54x), 
3. paradoxical clausal complements in verbless clauses (13x). 


8.2.1 Clausal Cognitive Complements of Cognitive /Mental Verbs 

The repertoire of cognitive /mental verbs encountered is fairly small. Most often seen in these 
constructions are verbs having the roots 131 ‘remember’, yT ‘know’, 787 ‘see’, and ynw ‘hear’, 
but others put in appearances—even mn ‘be’, once." A complement of a cognitive /mental verb 
CIC is usually labeled cogv cmpl/cue (“cognitive complement"). Here, the licensing relation is 
cue, which indicates that the CIC includes a cue phrase—a segment that signals the onset of an 
embedded discourse unit.'* The cue phrase used with complements of cognitive /mental verbs is 
almost always the complementizer *3 ‘that’. Phrase marker (8.9) provides an example, taken from 
Jer 16:21, of a clausal cognitive complement of the cognitive /mental verb 7T ‘they will know’: 


16. Trask, A Dictionary, 140: “The reporting of what someone else has said without using his/her exact words." 

17. Since we work with the Kethiv text, we treat Judg 16:25 as involving a clausal complement. 

18. Also termed a “discourse marker" or “discourse particle." See Kerstin Fischer, ed., Approaches to Discourse 
Particles (Bingley, UK: JAI, 2008). 
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(8.9) cl dl seq w , 
oblq gram | and 
VM vb : 
gram WT they will know 
oa -3 that 
cl sbj n:abst —" 
oblq gram suf —— DV name 


N MG mi 


sbj cmp :mm Yahweh 


gram 


Here, the embedded clause is verbless (verbless clauses are taken up in chap. 19). 


There are compound complements 16 times.'? In these cases, the complementizer (^3) is re- 
peated in a coordinated structure dominated by a cogv cmpl/union/disj node. Phrase marker (8.10) 
is an example from Gen 29:12. The embedded structures are usually simple clauses but are sen- 


tences 112 times. 


(8.10) cl —„dseqw n 
oblq Ne gram ! 
vb à 
N gram E S 
sbj " 
ain apy 
ind obj to + humn "^ 


cogv cmpl cogv cmpl 
union/disj cue 


cl sbj cmp n:humn 
oblq gram cstr 


cogv cmpl 
cue 


cl sbj cmp n:humn 
— e ———- 


oblq gram cstr EN 7 
"231 


sbj 
gram 


and 

he reported 
Jacob 

to 

Rahel 
that 
brother of 
father 
her 

he 

and 

that 

son of 
Ribqah 


he 


19. These may be found in Gen 3:6, 29:12; Exod 3:11, 4:31; Josh 2:9, 8:21, 10:1; 1 Sam 31:7; 2 Sam 5:12, 16:10; 


1 Kgs 11:21; Jer 40:7, 40:11; Job 7:17, 31:25; and 1 Chr 10:7. 
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8.2.2 Clausal Complements in Interrogative Expressions 
We have classified 54 interrogative expressions as involving clausal complements. The basic 
structure is as seen in phrase marker (8.11), a verbless clause from Job 21:15: 


ee ccu M = 
Ky = "TU  Shadday 
= 73 that 
B. " on — — — "139i we will serve 
N droh xo his/him 


8.2.8 Clausal Complements in Verbless Clauses 
We have classified 13 verbless clauses as involving clausal complements.”° The structure is as 
seen in phrase marker (8.12) from Ps 119:71: 


(8.12) a LL, sbj cmp n:qual 


oblq gram app "Mo good 
benf to * pro ^ to 
gram prep iE NN 
"5 me 
cogv cmpl i 
pdox *39 that 


Na z O rei "m3p Iwas humbled 


That the CIC dominating this clausal complement is cogv cmpl/pdox is an abuse of terminology 
again occasioned by our desire to avoid undue proliferation of CIC categories. Just how the con- 
stituents combine, and to what effect, is unclear to us. Hence our introduction of the paradoxical 
licensing relation, pdox. The parse shown here could also be used for situations similar to (8.11). 
Note that the phrase marker exhibits tangling and therefore is not a tree (see chap. 20). 


8.3 Adverbial Subordinated Clauses 


In $8.1, we dealt with one sort of subordinated clause, the nominalized (or relative) clause, 
which usually is in apposition to a noun phrase. Then in $8.2, we dealt with another sort of subordi- 
nated clause, the verb-complement clause, which is usually the complement of a cognitive / mental 
verb. We turn now to the third kind of subordinated clause, the adverbial clause: 


A subordinate clause which bears to its main clause any of a range of semantic relations similar to 
those borne by adverbs, such as time, manner, place, instrument, circumstance, concession, purpose, 
result, cause or condition.?! 


20. See Exod 3:12, 5:2; Josh 22:34, 24:22; 1 Sam 12:5, 21:16; 2 Kgs 20:9; Jer 44:29; Ps 119:71; Prov 22:8; Ruth 
4:9: Qoh 9:3; Lam 3:27. 
21. Trask, A Dictionary, 10. 
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Readers who have access to our data may have noticed that the syntax search interface for CICs 
contains this set of eight “mixed-level constituents” (five of which appear as instances in Trask’s 
definition of the adverbial clause above): 


Aim / Purpose| Condition 


Concessive | Discourse-Level Sequential-1'*' 


| Reason | Time Point 


Undesired Outcome 


These CICs are called mixed-level because they can occur within clauses as phrasal CICs or as 
supra-clausal elementary discourse units (EDUs). Consider the reason CIC (rsn/gram) in phrase 
marker (8.13) from 2 Chr 29:9. The final CIC is structurally a prepositional phrase. Functionally, it 
provides the reason for the state of affairs. The clause is translated by the Nyps as ‘and our sons and 
daughters and wives are in captivity on account of this’. 


ics. RA "E istina TON 
Ns | : 

gen — wd —* "tug" —* IR sons 
i 1 us 

^ and 

— "pia daughters 

E 1] us 

^ and 

ped —— "V women 

1 us 
loc in * war "3 in 


gram prep 


E nwar 9 the 
B "V captivity 
ren upon * pron -by upon 


gram prep : 
I iNT this 


Compare phrase marker (8.14) from Gen 43:30: 


22. Strictly speaking, this is an interim mispositioned cue phrase rather than a true CIC. 
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a ee >" and 
` " 
x b" obla ae den — ^nm he hurried 
M N hd to? Yosep 
\ m ————— » 73 because 
N eee Eod ———————————— m they grew hot 
N A yoy —+ "Dm compassion 
| his/him 
ind obj to * humn — “bx unto 


gram prep 


— “NN brother 
| his/him 


Here, we have represented the reason (rsn/cue) as a supra-clausal elementary discourse unit 
(EDU). We discuss the supra-clausal domain in chap. 21. For now, we simply note that the main 
clause with its subordinated adverbial clause has been analyzed into two elementary discourse 
units. The first is simply called a “unit” (unit/cue), our way of indicating that it has the status of 
an EDU in discourse but that we have not yet decided its precise EDU class. The second unit is 
truly a reason EDU (rsn/cue) and is marked as such by the cue phrase 73, glossed ‘because’. The 
two EDUs combine to form a discourse unit represented at this tentative stage of our analysis by 
the node having the nondescript label sent/discrs (sentence licensed in discourse). In a perhaps 
jarring departure from convention, we have promoted the subordinated clause and its subordinating 
conjunction to discourse level in preparation for proper handling by discourse analysis. 


8.4 Embedded Clause-Like Types 


Above, we have discussed traditional embedded clauses. In addition to those, we have al- 
ready seen two other kinds of construction involving embedding that exhibit clause-like behavior: 
nominalized participles and nominalized infinitives construct. Take a look at the following phrase 
markers: 


Clause-Like Type Locations of Phrase Markers 


Noun-Verb Participle (3.4) (6.7) (6.33) 
1 Sam 8:1 Isa 62:6 Prov 21:24 
(6.8) (6.12) (7.1) 
Jer 22:16 Gen 2:5 Ruth 1:1 


Noun-[ Verb / Noun] (3.6) 
Participle Hos 14:8 
Infinitive Construct (3.2) 

Gen 37:25 
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In each case, the participle or infinitive construct behaves as a noun “up-front” and as a verb “out- 
back” (put technically, it behaves exocentrically as a noun and endocentrically as a verb). In the 
next two subsections, we discuss the verbal nouns and show a few interesting phrase markers. 


8.4.1 Noun-Verb Participles 

Based on our classification, there are 2,413 constructions containing noun-verb and noun- 
[verb/noun] participles in Biblical Hebrew.” Of these, 29 participles are parts of non-tree phrase 
markers, 27 having two mothers and two having three mothers (in Ps 106:21 and Ps 136:7). Con- 
sider phrase marker (8.15) for the three-mother instance in Ps 106:21—22. 

We make four generalizations about participial constructions: 


]. The participles almost always have a single parent. 

2. The participles are almost always the first CIC in their construction.? 

3. The subject of the participial construction is almost always null (covert").? 
4. The most common participial construction has two CICs.”° 


8.15 cl vb | 
( ) oblq gram MIW they forgot 
dir obj — , n:deity n:deity 5M God 


— 
gram app app en ee 
n:deity "er 
suff Mil saver 


Do  them(m) 


nom ptc nom ptc n-v ptc " ; 
juxt oblq gram nV» making 
dir obj " " 
—  ———— Man 
om ni} big ones 
loc in + goog "| in 


gram prep TYPE T 
OY Egypt 


nom ptc dir obj 


obla oum ninda) wonders 


loc in + geog , 
gram re 3 h 


hc n.geog 
est ——— ————— VW land of 


Dn Ham 


nom ptc dir obj 
oblq gram 


NINI frightful 


loc upon + geog -by upon 


gram p sra | 
:10"D? Reed Sea 


23. The census is as follows: 2,383 have 1 parent; 27 have 2 parents distributed as follows: 2 in Deuteronomy, 11 in 
Isaiah, 3 in Jeremiah, 4 in the Minor Prophets, 6 in Psalms, and 1 in Job. Two have 3 parents as noted in the text above. 

24. After all, the nominal front of the verbal noun must plug in to something contiguous. The participle comes first 
2,401 times out of 2,413 instances (99.5%). The participle is preceded by another CIC 12 times: location (4x), direct 
object (3x), negative (2x), subject (1x), subject complement (1x), and instrument (1x). See §11.1.3.3. 

25. We find 4 distributive subjects (1 Sam 25:10, Jer 23:30, Ezek 3:13, and 2 Chr 18:9) and 16 subjects. One might 
argue that, in general, the identity of the subject inheres in the semantics of the nominal aspects of the participles. 

26. There are 1,869 two-CIC participial constructions out of 2,454 (76%). 
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This is how many CICs the noun-verb and noun-[verb/noun] participle constructions have: 


In almost 80% of the two-CIC instances, the participle is followed by either a direct object CIC 
(x705, 37.8%), a location CIC (x568, 30.5%), a movement origin CIC (x106, 5.7%), or an instru- 
ment CIC (x90, 4.8%). Of the 705 direct object CICs, 69 (10%) are pronoun suffixes on the parti- 
ciple and 28 (4%) appear as pronoun suffixes on object markers, leaving 608 (86%) as other free- 
standing phrases. Of the 564 location CICs, 509 (90%) are prepositional phrases. The preposition 
is 3 318 times (62%) and d} 162 times (32%). 


8.4.2 Nominalized Infinitives Construct 
Based on our classification, there are 6,682 nominalized infinitive construct constructions in 


Biblical Hebrew, none having more than one mother. Phrase marker (8.16) from Judg 16:23 is 
typical. 


(8.16) to * inf cstr 
prep y) to 
nom inf inf cst : 
oblq gram ^ t nar slaughter 
dir obj n:thing - z 
gram app CNN D3t sacrifice 
DN big 
ind obj to * deity à 
gram prep T p to 
n:deity 
app a mT Dagon 
prd "HON gods 
Di them(m) 


We make four generalizations regarding infinitive construct constructions: 


27. See Num 3:38, Deut 8:16 (2x), Isa 49:5, Hab 2:9, Prov 27:14, and 1 Chr 11:10. 
28. See Jer 2:6, which exhibits the CIC order: n-v ptc, dir obj, loc, loc, loc. 
29. See Deut 1:33, which exhibits the CIC order: n-v ptc, loc, loc, aim, loc, tm pt, aim. 
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1. The infinitive construct always has a single parent. 

2. The infinitive construct almost always is the first CIC in its construction. ?? 
3. The subject of the construction is typically null (74% of the time).?! 

4. The most common infinitive construct construction has two CICs.?? 


The infinitive construct preceded by a negative CIC in (8.17) from Gen 3:11 may be of interest.’ 


(8.17) to * inf cstr : 
prep ? to 
ce Wa ro 
peksa — R eat 


dir obj from + pron "35 from 


gram prep ' 


This table shows how many CICs the infinitive construct clause-like constructions have in Biblical 
Hebrew: 


CIC Count | Incidence 


With regard to the “pre-contexts” of the infinitives construct, 6,000 are preceded by prepositions 
(90% of the infinitives construct). The preposition is some form of -7 4,520 times (75% of the pre- 
posed prepositions). 


30. The infinitives construct are initial 6,547 times out of 6,682 instances (98%). They are preceded by a negative 
CIC 97 times. 

31. We find 1,736 subjects and four distributive subjects (Num 32:18; Jer 34:9; Esth 9:19, 9:22). Noted in passing: 
only two subjects come before their infinitives construct, those in Isa 30:7 and in Zech 8:9. 

32. There are 3,805 two-CIC infinitive construct constructions out of 6,682 (57%). 

33. On negation of the infinitive construct, see: C. H. J. van der Merwe, J. A. Naudé, and J. H. Kroeze, A Biblical 
Hebrew Reference Grammar (Sheffield: Sheffield Academic Press, 1999) 153; P. Joüon and T. Muraoka, A Grammar of 
Biblical Hebrew (rev. ed.; Rome: Pontifical Biblical Institute, 2002) 402. 

34. These are at Exod 16:3, 16:8; Lev 7:36; Deut 4:34; 1 Kgs 11:36; Jer 45:1; Dan 11:1; Neh 10:35; and 1 Chr 
16:40. 

35. See Lev 23:37-38, which exhibits the CIC order: inf cst, obj cmp, ind obj, dir obj, obj cmp, tm pt, accomp. 

36. See 1 Chr 23:31, which exhibits the CIC order: inf cst, dir obj, ind obj, tm aim, # times, cmpr, mnr, loc. 
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8.5 Brief Summary 


Embedded Clauses. We distinguish four kinds of embedded clause: nominalized clauses, 
clausal complements, subordinated clauses, and clause-like structures. 

Nominalized Clauses. Both the overall incidence of nominalized clauses and their use of an 
explicit nominalizer are very significantly depressed in the poetry books and, to a lesser extent, 
in the Latter Prophets. Following Holmstedt, we provide examples of several interesting kinds of 
nominalized clause: overtly and covertly headed, restrictive and nonrestrictive, resumptive, and 
extraposed (where our analysis differs significantly from Holmstedt’s). 

Clausal Complements. In our analysis, clausal complements appear in three contexts: as cog- 
nitive complements of cognitive /mental verbs, in interrogative expressions, and in verbless clauses. 
Clausal complements are usually marked by the complementizer 75 ‘that’. 

Adverbial Subordinated Clauses. Adverbial subordinated clauses are introduced here for com- 
pleteness but discussed in $21.3. 

Embedded Clause-Like Structures. These are mostly noun-verb participles and infinitives con- 
struct. The noun-verb participle structures are usually two (1,861x) or three CICs (452x) long. They 
mainly involve: a direct object (3896), a location (3096), or a movement origin (696). The nominal- 
ized infinitive construct structures are usually two (3,805x) or three CICs (1,718x) long. 
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Classifying Clause Immediate Constituents 


We begin this chapter with a reminder of some concepts introduced at the end of chap. 1. In 
§1.3.3.3, we discussed our first full-fledged phrase marker (from 2 Kgs 18:8, reproduced here). 


(1.5) cl sbj 


oblq N gram "Nin he 
vb i 
d nan hehit 
dir obj objmk*humn — ^ — — "DN [nota acc] 


Dnw Philistines 


mvt aim until + geog “Ty until 


gram prep ee RN 
my  Gazah 


Specifically, we touched on the ubiquitous licensing relations gram and oblq: 


[W ]hen we say that the assigning of some label to a clause immediate constituent (CIC) is licensed by 
grammar [gram] or that the assembling of a clause is licensed by obliqueness [oblq], the underlying 
grammatical mechanisms are unspecified. . .. The relations grammar and obliqueness stand for the 
very disparate, and still obscure, mechanisms that provide the grounds for classifying CICs and as- 
sembling them into clauses. . . . Here is the critical point for us: 
Gathering the data needed to begin working out the grammatical mechanisms underlying "gram- 
mar" and “obliqueness” is the focus of the latter two-thirds of this book. 


The point has arrived to begin making good on this last assertion. To this end: 


1. First, we consider the nature of the grammar licensing relation with regard to the taxonomy 
of CICs, their classification, and their recognition. 


* [n this chapter (9), we define five major subtypes of CICs and then discuss all but the 
second part of the fifth subtype (semantic roles) at length. 
* [n chap. 10, we take up semantic roles in detail. 


2. Second, we gather data that shed light on the nature of the obliqueness licensing relation 
with regard to the incidence, ordering, and makeup of the CICs in various kinds of clauses. 


e In chap. 11, we set out the methods and structures that we use to analyze specific verb 
root corpora. 


113 


114 Chapter 9 9.1 


* In chaps. 12-15, we study the composition, incidence, and ordering of CICs in the 
corpora defined by four very frequent verb roots in Biblical Hebrew. 
* In chap. 16, we examine the structures of the various CIC subtypes. 


9.1 The Clause-Immediate-Constituent Subtype Taxonomy 


It is very helpful to view the clause immediate constituents (CICs) in terms of the following 
simple taxonomy: 


CIC Subtypes 


Impermanents Syntactic Isolates  Predicators Operators Grammatical Functions 
and Semantic Roles 


Each CIC falls into one of the five exhaustive and mutually exclusive subtypes. 


9.2 Excursus: The Mixed and Full Approaches to Representation 


9.2.1 The Mixed Representation 

It is important to understand how grammatical function and semantic role are displayed in our 
system. Our simple labeling of phrase marker nodes with semantic category information (83.3.1 
and $4.3.2) broadens our phrase markers beyond those standardly used to depict syntactic relation- 
ships ($1.3.2). To enhance further our representation of clauses, we include information concerning 
the "three rudimentary categories all of which play a particularly crucial role in the understanding 
of human language: (a) parts of speech, (b) semantic roles, and (c) grammatical relations.”' We use 
these informal definitions of each category type, taken from our glossary: 


* Part of speech. The grammatical class or grammatical category of a segment. The 
specifics of our part-of-speech categories are given in chap. 3. 

e Semantic role. The semantic relation exhibited by a participant in a clause. For example, 
in “Gorgou smote the wee gozingpol," “Gorgou” has the semantic role of doer and “the 
wee gozingpol" has the sematic role of patient. Also known as the /ogical role, thematic 
role, theta role, thematic relation, case role, and deep case relation.? As we will see in 
detail in chap. 10, many of the CIC labels in our phrase markers specify semantic roles. 

* Grammatical relation. The syntactic function of a clause constituent in relation to other 
constituents in its clause. Examples: subject, predicator, direct object. “Any one of several 


1. Lindsay J. Whaley, Introduction to Typology: The Unity and Diversity of Language (Thousand Oaks, CA: Sage, 
1997) 54. The terms grammatical relation and grammatical function are used preferentially in “certain frameworks" (so 
R. L. Trask, A Dictionary of Grammatical Terms (London: Routledge, 1993] 122). We use grammatical function (GF) 
in our work. 

2. Whaley, Introduction, 75. 
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specific grammatical relations which a noun phrase can bear within its [clause]. The most 
widely recognized grammatical relations are subject, direct object, indirect object and 
oblique object.” ? 


An example may help. In “and Jacob broke camp Sukkot-ward” (Gen 33:17), “Jacob” has the part 
of speech proper noun, semantic role agent, and grammatical function subject. One might configure 
one’s representation so that the node label information would include all three. This sort of repre- 
sentation might be termed full. Instead, on an interim basis, we use a mixed representation. The 
rules governing the mixed representation are: 

* The part of speech and semantic category of each segment are indicated. * 

e Any major grammatical function’ exhibited by a CIC is preferentially shown. 

e [f a CIC has no major grammatical function, then its semantic role is displayed. 


Phrase marker (9.1) from Lev 17:11 illustrates our mixed representation. Labels sbj, dir obj, and 
ind obj assign grammatical functions (GFs), while labels loc and aim reveal semantic roles (SRs). 


oblq gram WwW i 
NS Aud ^nm gave 
dir obj -— 
gram | his/him 


ind obj to * pron ^ to 


gram prep DIETE e EE f 
D3 you(m) 


loc upon + furn a 
gram prep MC NN p upon 
‘or Lm ome 
Pis mat altar 
aim to * inf cstr % to 
gram prep f 
nom inf inf cst 
oblq ^ gram 182 atone 
benf upon * humn - 
pent — epi "y upon 
ped —— "^nU531 souls 


D3  you(m) 


3. Trask, A Dictionary, 123. Avery D. Andrews, “The Major Functions of the Noun Phrase,” in Language Typol- 
ogy and Syntactic Description, vol. 1: Clause Structure (2nd ed.; ed. Timothy Shopen; Cambridge: Cambridge Univer- 
sity Press, 2007) 134: “In the literature, the term ‘grammatical relation’ is used as a virtual synonym of ‘grammatical 
function.’ " 

4. In the Logos embodiment of our data, the part of speech and semantic category information pop up when the 
cursor hovers over a segment. For example, hovering over nara ‘altar’ in Lev 17:11 discloses that this segment is a 
common noun having the semantic category “appliance /furniture.” 

5. John H. Connolly (Constituent Order in Functional Grammar: Synchronic and Diachronic Perspectives [Berlin: 
Foris, 1991] 29) has the major clause elements “subject, predicator, object and complement.” 
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Our rules presuppose ways of discerning whether or not a CIC has a grammatical function in a 
given clause. This is not necessarily easy, since the identification of grammatical functions—es- 
pecially in flexible phrase-order languages such as Biblical Hebrew—often involves more than a 
simple syntactic analysis of the clause immediate constituents. As Connolly observes: 
We must differentiate between the strictly syntactic and semantic aspects of the terms ‘subject’ and 
‘object.’ Generally in linguistics these two terms are employed in a syntactic sense which, as with syn- 
tactic functional terms in general, designates their grammatical relationship to other elements in the 


same construction. . . . In [functional grammar], however, . . . they are used in an essentially semantic 
sense, to represent different vantage points or perspectives in the presentation of the state of affairs.° 


9.2.2 The Full Representation 

A price is exacted when semantic role information is squeezed out by grammatical function 
information. The specifics of the semantic role information may be important. For example, a con- 
stituent that functions as an indirect object “can bear a number of semantic roles such as recipi- 
ent. . . , experiencer. . . , and source.”’ When the loss of information is unacceptable, then a full 
representation is preferable. 

A full representation for the clause in Lev 17:11 might be as in phrase marker (9.2). We have 
replaced the uninformative licensing relation gram (grammar determined") by a semantic role in 
the sbj, dir obj, and ind obj CICs. Also, we have declared the grammatical function of the loc and 
aim CICs to be adjunct, moving the semantic role declarations onto the lower lines in the node 
labels. Hence, the upper line of each CIC label gives its grammatical function, and the lower line 
gives its semantic role. 


(9.2) a sbj 
oblq e 7 I 
vb ^nnj I gave 
pred ial 
dir obj 1 his/him 
patient 
ind obj to + pron 4 


poss prep a abs 
D3  you(m) 


adjunct , upon + furn "e. wa 


loc prep COUTE 
nun 3 the 
nara altar 


adjunct , to + inf cstr 
aim prep ^? to 
nom inf inf cst 
obla ^ pred *52 atone 
adjunct , upon + humn 
benf pt -dy upon 


6. Ibid., 47. 
7. Whaley, Introduction to Typology, 70. 
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9.3 Five Types of Clause Immediate Constituent 


In the next five subsections, we discuss most of our CIC subtypes. We devote chap. 10 to the 
second aspect of the fifth subtype, semantic roles. 


9.3.4 Impermanent Clause Immediate Constituents 

The impermanent CICs are incomplete or recalcitrant items and are either: (1) underspecified 
"and" cue phrases (incomplete) or (2) indeterminate constituents (recalcitrant). Eventually, the for- 
mer will move into discourse level, while the latter will remain indeterminate until we emend them. 


9.3.1.1 Underspecified “And” Cue Phrases 

Many phrase markers in this book begin with a coordinating conjunction CIC labeled dl seq 
w/gram (discourse level sequential w; see phrase marker [8.10]) or dl and/ tst in cl ( first in clause 
discourse level "and"; see phrase marker [8.13]). In our present database, the former occurs 17,429 
times, and the latter occurs 9,138 times. Ostensive rules for identifying all instances of “and” are 
given in $3.1.1. The underspecified "and" cue phrases are an easily delimited subset of these. 

Although diagrammed as CICs at the beginning of clauses, the underspecified “ands” are supra- 
clausal cue phrases (§21.3.2.1), the polysemy of which we have not yet resolved by discourse anal- 
ysis. We consider them to be impermanent interlopers at clause beginnings. They are not true CICs. 
9.3.1.2 Indeterminate Constituents 

There are two small sets of indeterminate CICs: the 24 lapsus calami CICs and the 33 nebulous 
CICs. These CICs should eventually be reclassified as a result of suitable emendation, exegetical in- 
sight, or fiat. We listed our set of lapsus calami in §3.2.1.1. The CICs labeled nebulous (neb/ gram) 
are a hodgepodge.? Consider, as an example, phrase marker (9.3) from 1 Sam 2:32: 


(9.3) cd __, diseqw 


oblq N gram ] and 
vb . 
gram no3n thou(m) wilt gaze 
"b 0 wer =x distress of 
gram cstr Gm RE 
Ny refuge 
loc in + nmlzr ^ in 
gram prep ; 
yor 53 allof 
b nof “WR which 
N a p W TO" he will do good 


dir obj obj mk + humn 


gram prep — TNR [nota acc.] 


oxi? Israel 


The clause, with its problematic jiyià ^x, has received disparate translations, including: 


8. They are in 1 Sam 2:32; 2 Sam 7:23 (2x), 14:26, 16:11, 17:3, 19:7; 1 Kgs 8:31, 11:15, 11:25; 2 Kgs 9:25; Isa 
8:6, 41:4, 43:14, 47:9; Jer 3:10, 8:15 (2x); Ezek 14:4, 27:19, 45:5; Mic 6:12; Hab 2:5; Ps 39:7, 59:13, 68:14, 104:15, 
141:5, 148:13; Prov 14:25; Qoh 5:6; Dan 11:18; and 1 Chr 26:16. Being so rare, they have no significant effect overall. 
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KJV And thou shalt see an enemy in my habitation, in all the wealth which God shall 
give Israel. 

RSV Then in distress you will look with envious eye on all the prosperity which shall be 
bestowed upon Israel. 

asy And thou shalt behold the affliction of my habitation, in all the wealth which God 
shall give Israel. 


9.3.2 Syntactically Isolated Clause Immediate Constituents 

Zwicky recognizes a curious kind of noun phrase (NP):? “Isolated NPs, standing alone (Hey, 
idiot!) or interrupting sentences (/'m afraid, you idiot, that your hair is on fire)" He sees “two 
kinds of uses for isolated NPs: vocative/exclamative, telegraphic," distinguishing two types of 
vocatives (address and call) and three types of exclamatives (epithet, dismay, and astonishment). 
He displays six telegraphic uses, which he sees as “shading off from fragment NPs”: offer, request, 
hot news, identification of type, identification of individual, and discourse topic. McCawley lucidly 
lays the situation out under the heading “Extrasentential Discourse Units.” 


There are a variety of linguistic units that are not obviously constituents of sentences, though they are 
in many cases loosely linked to particular sentences. .. . The absence, to my knowledge, of any respect 
in which vocatives behave like syntactic constituents of linguistic units that they appear in suggests 
that . . . either they are combined with the host S at the top of the syntactic structure . . . or they simply 
do not make up syntactic units with the host S. . . . Other types of linguistic units that often have host 
Ss but are not obviously constituents of the host Ss include many of the quite diverse items that are 
often indiscriminately lumped under the term “interjection.” '? 


We recognize three kinds of syntactically isolated CICs or isolates: vocatives, exclamatives, and 
labels (this last being akin to Zwicky's identification of telegraphic uses). 
9.3.2.1 Vocative CICs 

As our glossary states, a vocative is “a constituent in direct speech intended specifically to at- 
tract the attention of the one addressed." Here, we display vocatives in three kinds of context. 

A vocative can be the only CIC in an utterance, a stand-alone, truly isolated NP. For example, 
consider phrase marker (9.4) from Exod 3:4: 


(9.4) cl dl seq w — 
oblq je gram * 
= AX’ he said 
gram - 
obj addr cl voctv mhumn , T 
gram oblq gram echo "Va Moses 


nv Moses 


Alternatively, a vocative can interrupt a clause, which is what happens in phrase marker (9.5) from 
Deut 33:7: 


9. Arnold M. Zwicky, “Isolated NPs,” http://www-csli.stanford.edu/~zwicky/isolated.hnd.pdf. See also Kerstin 
Fischer, ed., Approaches to Discourse Particles (Bingley, UK: JAI, 2008) 8—9, where “integratedness” is discussed. 
10. James D. McCawley, The Syntactic Phenomena of English (2nd ed.; Chicago: University of Chicago Press, 
1998) 750—53. 
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an obi Bat YNW thou(m) hear 
NS pes mi Yahweh 


dir obj mvocbl __ iD voice of 


gram csi Bn NN 
immm Judah 


More typically, the vocative is clause initial or final. Once, in Jer 23:1, a vocative combines with 
two other isolates to produce an excl voctv labl clause. Phrase marker (9.6) shows this clause: 


9.6 cl excl . 
( ) TA 11 Hoy! 
voctv n:humn à 
gram  ' app myn shepherds 


N nom ptc n-v ptc n-v ptc Ead 
oblq gram union/disj nu ruining 
^ and 


wran scattering 


dir obj obj mk + anml 
gram prep 


TIN [nota acc.] 


INS flock of 


N n.geog __, ‘Yp 1 pasturage 


hi . - 
Ibl n:vocbl m 
an 0 om 
:mm Yahweh 
9.3.2.2 Exclamative CICs 
According to our glossary, an exclamative is “a constituent that expresses a sudden or strong 


emotion.” The constituent may be a single segment drawn from the list in §3.2.1.1 or may involve 
the echoing of one of these segments. As an example, consider phrase marker (9.7) from Isa 10:5: 


(9.7) cl excl ; 
oblq < aam "in Hoy! 
voctv n:geog - 
gram — app X "UN Asshur 
n-utns! ; 
pars 03V staff of 


bue ‘DN anger 
"en "5 me 


The constituent may be a complex noun phrase, as in phrase marker (9.8) from 2 Kgs 4:19: 
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(9.8) cl excl n:body prt n:body prt -— 
oblq gram echo suf —* v head 
li" w me 
n:body prt — , myy 
sun Wx head 
"5 me 


About one-third of the exclamatives occur in single-CIC clauses as stand-alone isolates. 


9.3.2.3 Label CICs 


There are 199 instances of stand-alone isolated labels. This leaves 482 instances where a label 
CIC does not stand alone. The label is the first CIC in these clauses 1396 of the time and is last 7396 


of the time. In phrase marker (9.9) for Jer 5:18, the label is medial: 


(9.9) cl dl and ^ 
oblq SS st in cl : 
gam 
gram 03 
tm pt in * time 4 
gram prep 
e n:time n:time in 
— SS 
app def : 
NS Dy 
MC 
nan 
Ibl n:vocbl - 
mam 
neg ; 
gram Ww» 
vb : 

gram nUyN 

accmp with + pron " 
D22 
dir obj > >> 
gram F 


9.3.3 Predicator CICs 


and 

also 

in 

the 

days 
the 
they(m) 
oracle of 
Yahweh 
not 

| shall make 
with 
you(m) 


destruction 


Our glossary definition of a predicator is “a verbal or quasiverbal constituent that specifies 


equivalence, activity, state, or process." The predicators are the focus of chaps. 11-15 and 18. 
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9.3.4 Operator CICs 


We next take up what we term operator CICs. An operator is a “grammatical element which 
bears a scoping relation to some part of its [text]. Examples include determiners and quantifiers, 
negation, tense, aspect and mood.” !! The scope of an operator refers “to that stretch of language 
affected by the meaning of a particular form. . . . As a general illustration, in English the scope of 
negation typically extends from the negative word until the end of the clause." ? For Biblical He- 
brew, certain operators have as their scope a phrase, an entire clause, or even multiple clauses. Each 
involves a single part of speech. Our six operator CICs, along with their definitions, are: 


* negative. A segment that negates a constituent or constituents. 

* closed interrogative. A segment that asks a yes-no question about a constituent. 

e 03 /D3 ‘also’. When functioning within a clause, this distributionally complex segment 
expresses inclusion of some number of following constituents. We keep this lexeme 
separate from the includer group for reasons given in $3.2.1.3. 

* includer (48 ‘also’). When functioning within a clause, this segment expresses inclusion 
of some number of following constituents. 

e restricter (pn ‘only’ and many instances of 7&). When functioning within a clause, these 
segments express restriction of some number of following constituents. 

* modal. A segment or idiom that expresses the speaker's (un)certainty or desire with 
regard to some statement. 


The operator CICs exhibit disparate characteristics. While negative, (closed) interrogative, and to a 
lesser extent 03 ‘also’, includer, and restricter CICs exhibit classical variable scoping, the scoping 
of modal CICs is often difficult to establish with confidence (see $9.3.4.4). 

9.3.4.1 Negative CICs 

The negative part of speech was defined ostensively in $3.2.2.3. For the purposes of this sub- 
section, we present a trio of phrase markers illustrating the progression of scoping of a negative 
operator from phrase to clause to multiple clauses. 

Phrasal Negative Operators. A negative can operate upon a noun phrase to produce a con- 
stituent consisting of a negative operator and a noun phrase licensed by inverted modification (inv 
mod). The scope of negation is not necessarily to the end of the clause. Phrase marker (9.10) from 
Jer 5:7 contains the negated noun phrase 0°78 N° ‘not gods’: 


M e. 1 am 
N en WIW?  they(m) swore 
— mdi i ae 
Ne — i na 
DDN gods 


11. Trask, A Dictionary, 195. To align with Biblical Hebrew practice, we have replaced Trask’s “sentence” with 
“text” in the definition. 
12. David Crystal, A Dictionary of Linguistics and Phonetics (5th ed.; Oxford: Blackwell, 2003) 407. 
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Clausal Negative Operators. There are 5,316 instances in which a negative operator immedi- 
ately precedes a predicator, such as in (9.11) from 2 Chr 20:33: 


(9.11) d dl and us 
oblq NS in cl l 
oram Tiy still 
i n:humn , 
gam? def n the 
DY people 
gram UU -Nb not 
vb l ' u 
gram —————————— WD ey appointe 
drobi body ot aha 
D? them(m) 
ind obj to * deity 
gram prep b do 
pend ————— "HON gods of 
^ p ^ — nag fathers 
:0 them(m) 


Given this very common adjacency behavior, one might be tempted to treat the negative as operat- 
ing at phrase level to form a negative verbal constituent licensed by inverted modification. For two 
reasons, we prefer to treat such negatives as operating upon their entire clauses: 


1. There are 279 instances in which the negative operator is not adjacent to the predicator. 
Any representation of negative operators must handle this sort of separation. Phrase marker 
(9.12) from Ps 6:2 illustrates the behavior to which we refer: 


Pod hls ind — im Yahweh 
N gram “Ox don't 
instr in + mntl = 4 
gram prep 2 in 
psg — ‘5X anger 
E T?  thee(m) 
hd 'rr3in do thou(m) reprove 
dir obj "9 né 


gram 
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Combining the negative and the verb to form a constituent would involve “tangling,” producing 
a non-tree phrase marker ($4.2). We avoid this complication where possible. 


2. Polarity is a characteristic of the clause in discourse analysis, where one encounters 
assertions such as: “Clause, and Clause, differ in polarity (i.e., one clause is positive and 
the other negative)."? Given our interest in discourse analysis, we assign polarity to the 
clause. 


Multiple-Clause Negative Operators. The scope of a negative operator can extend over mul- 
tiple clauses. Consider phrase marker (9.13) for a pair of juxtaposed clauses from Isa 38:18: 


(9.13) sent cl neg 


juxt oblq gram N? not 


Y sbj 


X oat DiNU Sheol 


V SS. $I she will praise 
ST  thee(m) 
—_———— dis eath 


v — o 
oblq EM gram S 
_ “om he will praise 


dir obj 
gram JS 


thee(m) 


In this and similar instances, we have used the mechanism of multidominance to indicate the ex- 
tended scope of the negative operator. As a result, the phrase marker is not a tree (this matter is 
discussed in 820.2). 

As an additional example, consider this interesting phrase marker (9.14) from Isa 23:4 (see 
p. 124). 


9.3.4.2 Closed Interrogative CICs 

The interrogative part of speech was defined ostensively in $3.2.2.4. Closed interrogatives are 
operators in that they, like negatives, can exhibit phrasal, clausal, or multi-clausal scoping. '* 

In Biblical Hebrew, the interrogative operators for closed questions are -7 ‘yes-no?’ and 
DN '[question|'. Only if we were to introduce a rhetorical question operator, "Non ‘is not... ? 
(rhet / gram), ^ would the interrogative operator be a phrase level constituent of the rhetorical opera- 
tor. Phrase markers (9.15a) and (9.15b) from Job 10:20 show the two approaches. 


13. Simon Corston-Oliver, “Beyond String Matching and Cue Phrases: Improving Efficiency and Coverage in 
Discourse Analysis," Microsoft Research Tech Report, November 1998, 4. http: / / www.research.microsoft.com / 
apps / pubs / default.aspx ?id=69677. 

14. Andrew Radford, Syntax: A Minimalist Introduction (Cambridge: Cambridge University Press, 1997) 130—47. 

15. We have not yet decided whether to introduce a rhetorical operator CIC. 
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unions E unions UU Eon amd dam — — w5 mt 
\ NN BN Rd —————————o bn |wasin labor 
N X NG EE OE SIS »" and 
AN i sb — Md — 89 nmt 
\ x i Rd nT” bore 
\ X. "RD PESE EEEE PEREA AELA >" and 
: uxt nm obig un W^ not 
x am nD | made big 
^ pase, ————— ona young men 
N o — oan ——————— 'mbbi^ raised 
pies ——————— :nidina virgins 
(9.15a) (9.15b) 


"5 yes/no cl rhet intrg 


L——————— 1 yes/no 


oblq WI gram 3 oblq gram mod a i 
neg — "Nó not Nd not 


gram 

yo ——— — — vyn little m ————————— vyn little 
sbj n:time , sbj n:time "^ 

gram suff > ‘I, days — auf Css 


gram 
b ae 


E me 


Most translations render the clause as a rhetorical question: "Are not the days of my life few?" 
(NRSV). The wipes translates it as having indicative modality: “My days are few." 
Usually, a closed interrogative scopes a single clause, as in (9.16) from Ruth 1:19: 


(9.16) 


cl dl seq w 


oblq gram ^ and 
vb ; ; 
gram niwaNn  they(f) said 


obj addr cl q closed 
— — 


gram "^ yes/no 


oblq N gram : 
sbj i this 


gam ^ ^ SDyl Noomi 


9.3 
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A closed interrogative can also scope multiple clauses. For example, in phrase marker (9.17) from 
Jer 49:7, the interrogative operator scopes the three juxtaposed clauses making up the sentence. 
(9.17) sent — , d q closed ^" yes/no 
juxt oblq gram 

"n 

^ qv e 

i gram TN does not exist 

1 tm int 

1 gram — W 

\ sbj 


still 
— np3n wisdom 
gram Tr 
loc 


d in * geog "01 

[ gram prep Bn PN i i 

\ vn Teman 
[x kind TITAN she perished 
y  Oblq gram d. 
i sbj 
gen — — n ny counsel 
i mvt orig from+humn —— ——  ——,, 
i gram ^ ^ prep D from 
| Bu a Da sons 
yc vb 

oblq > gamn — — — — ————* nnno) she corrupted 
b sbj 
gram 


n:abst 


— ‘nian wisdom 
:Do — them(m) 

9.3.4.3 Quantifier CICs 

A group of operator CICs, known as quantifiers or focus particles, specifies the domain of 
inclusion (“includers”) or restriction (“restricters”) of the constituents upon which they operate. 
Because “their meaning always indicates that the referent to which they refer is an addition to or 
limitation of another referent,” 5 they are termed focus particles by some. We prefer to call them 
quantifiers. 

Includer CICs 03/03 ‘Also’ and 9X ‘Also’. 
instances, as in phrase marker (9.18) from Gen 47:3: 


These were touched on in $3.2.1.3. We find phrasal 


Sheffield Academic Press, 1999) 311. 


16. C. H. J. van der Merwe, J. A. Naudé, and J. H. Kroeze, A Biblical Hebrew Reference Grammar (Sheffield: 
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(9.18) dis ea pati myo shepherd of 
MN SN IN — flock 
Es —> ji /— pen —— "Tay servants 
q thee(m) 
Pid — nma L also 
IMN we 
meom am 
ipis — "niay fathers 
33 us 


We find 03 /D3 having single-clause scope, as in phrase marker (9.19) from Exod 2:19: 


(9.19) cl dl and ^ 
oblq Ww stin cl : 

gam T 
gram 03 

inf abs int ; 
gram "1 

vb 

gram n?1 
benf to * pron > 
gram prep f 


1 


and 


also 


to draw water 


he drew water 


to 


us 


And we find 03 /03 with multiple-clause scope, such as in phrase marker (9.20) from 1 Sam 15:29. 
In this extensive multi-clause phrase marker, 03 /D3 is a supra-clausal cue phrase (this notion is dis- 
cussed in chap. 21). As noted in $3.2.1.3, the uses of 03 /D3 are many and complex. Consequently, 
in practice, we have left 03 /D3 as the sole member of its own class (see (9.20), p. 127). 


Restricter CICs 78 ‘Only’ and p^ ‘Only’. 


We have divided TN into two classes, epistemic 


modal adverb ('surely', 62x, $3.2.2.1) and restrictive adverb (99x, $3.2.2.2). Each CIC pn (25x) 
is classified as a restrictive adverb ($3.2.2.2). With one exception (Deut 28:13), the restrictives are 
clause initial." Phrase marker (9.21) from Num 12:2 uniquely includes both kinds of restricter. 


17. However, in Josh 6:18, p3 follows “and.” 
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(9.20) sent , unit 


discrs 7 cue Re pg A Ah e E ae Pee eg > D3 also 
‘ v. 
` N sent cl dir obj n:event : 
\ ` union/disj 7777 * obiq gram cstr iN ny) victory of 
1 ^N 
i NN A Oxi Israel 
i NS 
` AN neg 
i VN oram N5 not 
i \ EN vb 
[| \ \ al į i 
\ VON gram SOP? he will deceive 
\ ll ——————————————————— >". and 
\ «d e neg Nb not 
| e S ~ 
\ vb Dn» he will regret 
\ gram ? 
in ne a 
cue S7 2 because 
cl sbj cmp n:humn 
E obiq gram inv mod Mn RENE N? not 
IN DIN man 
sby 
gram Nin he 
aim to + inf cstr 5 to 
gram prep 
nom inf inf cst 


oba gram regret 


(9.21) cl q closed ; 
oblq ML gam ^ —— — —  —* m yes/no 
rstr 
gram —— — — — —-* p? only 
rstr a 
gram only 
agent in + humn "1i 
gram prep TOM 
nna Moses 
vb DO. 437 he spoke 
gram i1 
sbj 
gram ^ —— —— — ——* mim Yahweh 


9.3.4.4 Modal CICs 

A modal is a segment that expresses the speaker's uncertainty or desire with regard to a particu- 
lar statement. As defined ostensively in §3.2.2.1, our modals consist of seven lexemes. We know of 
no modal that operates at phrase level. Single-clause scoping is typical. Multiple-clause scoping is 
less common. As an example, consider phrase marker (9.22) from 1 Kgs 18:5, which contains an 
instance of the modal "938% ‘perhaps’. With modals, we encounter a kind of multiple-clause scop- 
ing that differs from what we have seen with the previous operator CICs. With the other operators, 
the extent of scoping is fairly clear-cut in any given case. But with the modals, operator scope is 
less obvious. What is the scope of the modal in (9.22)? Since the clauses form a discourse unit, the 
scope of the modal is all three clauses. Hence, in a proper discourse representation, the modal will 
be supra-clausal. 
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(9.22) sent — ^. > a pmd ix perhaps 
union/disj oblq gram d 
EN S b 
* NN vl z 
N N Lon NyD3 we will find 
M N EN dir obj vrn grass 
WAON gram 7 
Wu X A Rh RERO ee >" and 
Vy So oo, Vb =, mm we will keep alive 
i oblq A gram : 
\ \ dir obj n:anml 
\ i gram union/disj EE CNN none 
\ \ "^ and 
Y T) mule 
\ pe cn ee ee ee ae Se eee >" and 
vd L neg Nib not 
oblq Se gram 
N vb m23 we will cut off 
gram 
deprv from + anmi , 
———— 
gram prep p from 
ac NN n:anml 


777202 beast 
Since the modals appear only in direct speech, we always know the upper bound on the extent of 


scoping—the end of the speech. But the actual extent of the scope typically needs working out. 
Since multiple-clause scoping is really an issue at discourse level, the working out of the best 
way of representing scoping can wait until we get into formal discourse analysis. For now, it is 


enough to alert readers to scoping issues so they can be attentive as they work with the texts. 
9.3.5 The Grammatical Function and Semantic Role CICs 


In this subsection we discuss the grammatical function aspect of clause immediate constituents, 
deferring the discussion of the semantic roles until chap. 10. 
9.3.5.1 Our Repertoire of Grammatical Function CICs 


Whaley portrays linguistic opinion regarding the primary grammatical functions as follows: 


Both in traditional grammar and in several contemporary syntactic theories . . ., it is assumed that 
there are three primary grammatical relations that can be held by noun phrases in a clause: subject, 
direct object, and indirect object. . . . The[se] entities specified by the verb are called arguments. '* 


As long as we use a mixed representation (89.2.1), we subdivide two of the primary grammatical 
functions and thereby obtain these seven interim grammatical function CICs: !° 


changeably (see Trask, A Dictionary, 122). 


18. Whaley, Introduction to Typology, 68—69. We use “grammatical relation" and “grammatical function" inter- 


19. The subject complement and object complement are not complements of the predicator but rather of the subject 
and object, respectively, and so are not included in our table. See Crystal, A Dictionary, 88. 
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Master 
Category 


Master 
Category 


Subsidiary Categories 
subject (sbj) 
subject-of-address (sbj addr) 


Subsidiary Categories 
direct object (dir obj) 
object-of-address (obj addr) 
second object (2nd obj) 


direct 


bject : 
Diod object 


Janus sbj-cmp/sbj (Janus sc/sb) 


indirect | indirect object (ind obj) 
object 


We have already provided examples of several of the CICs in this group: 


1. For subject, see phrase marker (9.1) in $9.2.1. 
2. The subject-of-address CIC has a speech as the subject of a Niphal passive of an utterance 
verb. As an example, consider phrase marker (9.23) from Gen 22:14: 


(9.23) cl vb 


ox i 
oblq N gram VAX? it will be said 
tm pt n:time , 
gram def 3 the 


sbj addr cl loc in + geog 


gram oblq gram prep 3 in 
n:geog > 
i c aae 9n mountain of 
mim Yahweh 
» “NV he will appear 
gram dod 


3. Phrase marker (9.24) shows a Janus sc /sb in 1 Chr 6:2. On this CIC type, see Appendix 6. 


(9.24) sent ^. cl dl and ; 
discrs oblq UU st in cl | and 
N 2 D?N these 
\ cl Janus sc/sb n:abst ninw names f 


——— 
N n:humn iw 
a em 12 sons of 
N Divi Gershom 
sbj cmp n:humn Libni 


^ and 


YNV Shimi 


4. For examples of direct objects, see phrase markers (1.5), (4.1), (4.2), (5.1), and so on. 
5. For an instance of an object-of-address, see phrase marker (9.4). 
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6. Second objects go with ditransitive verbs as in phrase marker (9.25) from Gen 41:39: 


(9.25) nom inf inf cst p'Tin inform 


oblq gram 
ND MIL sb DDN God 


gram 
dir obj objmk+pron ooo oo 

gram prep mx [nota acc.] 

T2  thee(m) 

2nd obj obj mk * pron 

gram prep TN [nota acc.] 

pers pr -b~ 
cstr 22 all of 
MNT this 


7. For indirect objects, see phrase markers (8.10), (8.14), (8.16), (9.1), among others. 


9.3.5.2 Special Characteristics of Some Grammatical Function CICs 

In addition to supplying CIC labels and the vague gram licensing relation, we flag four 
CIC phenomena: suspension (881x), resumption (733x), distribution (698x), and open inter- 
rogation (1,460x). Interestingly, with one maddening exception, these phenomena never occur 
simultaneously. ?? 

Suspension and Resumption. Sometimes a grammatical function is repeated (“resumed”), usu- 
ally by a pronoun. Phrase marker (9.26) from Gen 3:12 shows a suspended and resumed subject. 
CICs other than grammatical functions also exhibit suspension and resumption. Phrase marker 
(9.27) from Jer 50:21 shows a pair of location CICs exhibiting suspension /resumption. 


9.26) cl sbj n:humn n:humn ; 
( A susp app def ) the 


\ N MWR woman 
pan WN which 


nom 


N cl vb 
— 
obiq gram 

ha accmp with * pron "Oy with 


san as wa | 
"IH? me 


nnni thou(m)didst give 


sbj 
resum NY! she 

vb 
gram "Mni she gave 
ind obj to * pron "5 


gram prep ~A 7 v 
O me 


dir obj from + veg 
gram prep 


"H3 from 


——— n the 


be yy tree 


20. In Exod 32:33, we encounter a suspended direct object that contains an open interrogative. 
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(9.27) cl loc u 
pon * geog E 
oblq susp prep 2D upon 
NS _n:geog n:geog ; 
cstr ^ def m the 
PON land 
wn Merathayim 
vb 
gram ——————————————— DD thou(m) ascend 
loc upon * pron " 
resum prep Nc NÉ, —— — —""y upon 
i| her 
Distribution. As our glossary has it, a distributive subject is “[a] singular nominal subject that 
refers separately to each and every member of a plural subject. Example: ‘They . . . each man.’ ” 


And similarly for other kinds of CICs. Phrase marker (9.28) from Exod 33:4 includes a distributive 


subject, a distributive object, and a distributive location CIC: 


(9.28) cl dl and ^ 
oblq SN st in cl : 
neg ' 
gram N? 
vb ; 
gram inv 
sbj A 
dist i 
dir obj n:garm " 
dist uff "y 
i 
loc upon + pron ^" 
dist p 


prep £ 


Open Interrogation. 


and 

not 

they put 
each 
adornment 
his/him 
upon 


his/him 


Two interrogative parts of speech were defined ostensively in §3.2.2.4. 


Questions are of two sorts, closed and open.?' The former “ask questions with a closed set of 


»22 


answers, 


while the latter involve “questions where the set of answers is open-ended."? Closed 


interrogatives were taken up in $9.3.4.2. Here, we focus on open interrogatives. Regarding open 


interrogatives, Huddleston comments: 


The grammatically distinctive property of open interrogatives is the presence of an interrogative word: 
who, what, which, why, etc. . . . The interrogative words . . . combine two roles: one as a marker of 
open interrogative clause type, one as a pronoun, determiner, temporal adverb, etc., with functions 


that can also be filled by noninterrogative words.?* 


21. R. D. Huddleston, “Sentence Types and Clause Subordination,” in Concise Encyclopedia of Grammatical Cat- 


egories (ed. K. Brown and J. Miller; Amsterdam: Elsevier, 1999) 334—37. 
22. Ibid., 334. 
23. Ibid., 335. 
24. Ibid. 
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What Huddleston is getting at can readily be seen by considering a concrete example. Consider 
phrase marker (9.29) from Prov 6:9: 


(9.29) cl — , tmint X , until* intrg 


oblq open intrg prep “TY until 


i" nA when 


voctv 

gram p" lazy 

vo 33Un  thou(m) wilt lie down 
gram 7% 


As its first role, the interrogative ^nià ‘when?’ indicates that the clause involves an open question. 
As its second role, the CIC is part of a time interval (tm int) adjunct. Our representation shows that 
the CIC is a time adjunct, a tm int, and also indicates that the clause is the interrogative (open intrg). 

Many open-interrogative segments are not immediately dominated by a CIC-node but, rather, 
are parts of phrases. These constructions are usually set phrases well on their way to being lexical- 
ized. Example: n^? ‘to what?’ = ‘why?’ Three hundred eighty-two open interrogatives are parts 
of basic phrases such as this. In addition, there are 19 atypical phrasal interrogatives.? The phrasal 
interrogative is never deeply embedded,” and so we may represent the situation as before. Phrase 
marker (9.30) from Job 38:29 illustrates our representation in these sorts of cases: 


(9.30) cl mvt orig from * body prt 
oblq open intrg prep 


‘2 from 


N n:body prt belly of 


cstr -—— 193 
7 who 


Vb NY' he went out 
gram Tr 

sbj n:sbst , 

gram def n the 


np ice 


9.4 On Recognizing CIC-Types 


Having devised and displayed an exhaustive top-level taxonomy of CIC-types and having spec- 
ified the kinds of CICs that belong to each subtype,? producing at least one example of each, we 
now need only to comment on how each CIC is recognized and assigned to its proper subtype. In 
other words, what principles and rules does the gram licensing relation subsume? 


25. An open interrogative is a nomen rectum in Gen 24:23; 24:47; 1 Sam 12:3 (3x); 17:55, 56, 58; Jer 8:9; 44:28; 
Job 26:4, 7; 38:29. Coordinated interrogatives are found in Exod 10:8; 1 Kgs 2:36, 2:42; 2 Kgs 5:25. A discontinuous 
echo is found in 2 Kgs 9:32. The construction in Job 26:7 (mnha) is unique. 

26. The open interrogative CIC dominates a basic phrase ($5.2), except in Job 26:7 and 38:29 (each of which in- 
volves a prepositional phrase). 

27. Recall that any CIC belongs to one of the following five exhaustive and mutually exclusive subtypes: imperma- 
nent, syntactic isolate, predicator, operator, ( grammatical function and semantic role]. 
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The CIC category of a few kinds of CICs is readily evident. For example, the exclamatives and 
modals involve little homography and thus are for the most part easily classified.** But the recog- 
nition of most CIC categories presupposes that the clause of which they are a part has been fully 
parsed. In these cases, the gram licensing relation subsumes a full grammar of Biblical Hebrew and 
then some. 

Consider, for example, recognition of grammatical function. While Whaley asserts in effect that 
the grammatical function subtype is “identifiable by special morphological and syntactic marking," ?? 
this is not true for Biblical Hebrew where, for instance, direct objects, especially in poetry, are 
often unmarked. Another approach examines noun-phrase position relative to the predicator. For 
example, Gildea and Jurafsky “expect . . . [position] to be highly correlated with grammatical func- 
tion, since subjects will generally appear before a verb, and objects after."?? This assumes fixed 
constituent ordering, and so it is not very useful for Biblical Hebrew. In our computer parsing, we 
found that comparing predicator semantics and gender with noun-phrase semantics and gender was 
quite useful in distinguishing subjects from direct objects.*! In fact, in determining the grammatical 
functions of CICs, we have often had to draw intuitively on non-local information and on world 
knowledge.? The phrase markers that we analyze incorporate varying amounts of extrasyntactic 
information. Homographs have been resolved wherever we have become aware of their existence. 
This means, for example, that the forms of the object marker (ns, ns, “nix, and 'n&) have been 
classified differently from the identical forms functioning as *with'. As a consequence, more than 
800 CICs have been predetermined to have an accompanier semantic role rather than having the 
grammatical function of direct object. We take advantage of all such assistance in this investigation 
of grammatical functions and semantic roles. 


9.5 Brief Summary 


Excursus: Mixed versus Full Representation. The phrase markers in this book (and, at present, 
in our database) supply a mixed representation of the syntactic / semantic phenomena exhibited by 
Biblical Hebrew. They privilege display of GF information over display of SR information. In our 
planned full representation, both GF and SR information will always be displayed. 

The Top Levels of the CIC-Subtype Taxonomy. Clause immediate constituents are divided into 
these five subtypes: 


1. Impermanents. CICs that will, in due course, be assigned to other CIC classes. 

2. Syntactic isolates. NPs that are syntactically isolated—vocatives, exclamatives, and 
labels. 

3. Predicators. the verbal or quasiverbal nuclei of their clauses. 


28. Homography can be daunting. Consider the exclamative "Ax ‘ʻO! (found in four different lexemes!, §3.2.1.1) or 
the modal ^3 ‘surely’ (found in seven different lexemes!, $3.2.2.1). 

29. Whaley, Introduction to Typology, 67. 

30. Daniel Gildea and Daniel Jurafsky, “Automatic Labeling of Semantic Roles,” Comparative Linguistics 28 
(2002) 257. 

31. EI. Andersen and A. D. Forbes, “Opportune Parsing,” in Bible and Computer (Paris: Honoré Champion, 1995) 
61. 

32. For example, in determining the CIC label of a pronominal constituent, one must know its possibly-remote 
anaphor. 
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4. Operators. CICs that exert their influence over phrases, clauses, or multiple clauses— 
negatives, closed interrogatives, quantifiers (also known as “focus particles"), and modals. 

5. Grammatical functions / semantic roles. We recognize three basic GFs: subject, direct 
object, and indirect object. SRs are considered in chap. 10. 

Special Features of Some GFs/SRs. A CIC may exhibit the following special features: 


1. It may be suspended and later resumed, usually by a free pronoun. 
2. It may exhibit distribution. 
3. It may perform open interrogation. 


These phenomena are mutually exclusive, with one curious exception. 


Chapter 10 


Semantic Role CICs 


As we noted in chap. 9, nominal CICs can be described by their semantic roles—"the semantic 
relationship that a nominal bears to the rest of the clause."! Note that the nominal may be embed- 
ded in a prepositional phrase constituent. Indeed, some theories view prepositions as particles that 
simply mark their associated nominals for various roles:? * dependent marking’ [signals] the exis- 
tence of a grammatical relation between two elements of a sentence . . . by a marker placed on the 
dependent term." In this chapter, we discuss our set of semantic roles, their taxonomy, and their 
recognition. 

Semantic role analysis is part of many grammatical theories. The degree of refinement of an 
analysis depends on the importance that the roles have in a given theory. According to Van Valin: 


[In Government Binding Theory] nothing depends on which [semantic] role an argument receives, 
as long as it receives one (and only one), whereas in [Lexical Functional Grammar] the validity of the 
syntactic analysis depends crucially in many instances on the identity of the [semantic] role assigned.? 


Not surprisingly, role analysis in Government Binding Theory is much less developed than in 
Lexical Functional Grammar.* 

The various analyses differ with regard to role repertoire, role taxonomy, and role recognition 
criteria. We review typical approaches to these three issues, concluding each review with a sketch 
of our stance. 


10.1 Semantic Role Repertoire 


10.1.1 Representative Approaches 
The number of semantic roles defined in grammars varies vastly, from just a few to many hun- 
dreds.° This is because the extent of the repertoire depends on how the roles are used: 


[If the roles are] part of the system of lexical representation, wherein they represent aspects 
of the verb's meaning . .., then a large number will be needed to express the great variety of 


1. Lindsay J. Whaley, Introduction to Typology (Thousand Oaks, CA: Sage, 1997) 290. 

2. Avery D. Andrews, “The Major Functions of the Noun Phrase,” in Language Typology and Syntactic Descrip- 
tion, vol. 1: Clause Structure (2nd ed.; ed. T. Shopen; Cambridge: Cambridge University Press, 2007) 143. 

3. R. D. Van Valin Jr., “Functional Relations,” in Concise Encyclopedia of Grammatical Categories (ed. K. Brown 
and J. Miller; Amsterdam: Elsevier, 1999) 154. We have replaced 6-role with semantic role. 

4. Trask complains that “[t]he proponents of [Government Binding Theory] have been remarkably unforthcoming 
about precisely which [semantic roles] are posited in the framework" (R. L. Trask, A Dictionary of Grammatical Terms 
[London: Routledge, 1993] 278). 

5. Daniel Gildea and Daniel Jurafsky, “Automatic Labeling," Computational Linguistics 28 (2002) 247. 
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verbal semantic contrasts. . . . [If they] play a role in the statement of grammatical rules, prin- 
ciples, or constraints . . . , then only as many will be needed as the syntax requires, and this is 
a much smaller number than that required for a lexical representation function." 


The fullest lexical representation is that of FrameNet II, which tailors over 800 semantic roles spe- 
cifically verb-by-verb." By contrast, a typical abstract representation has “between eight and twenty 
unique roles."* Connolly (following Dik) provides the following list of 22 semantic functions (to 
use Connolly's terminology), with an English example of each.? 


Connolly's Semantic Functions 


English Example 


Time at midnight 


Duration for an hour 


Place in Abergynolwyn 


Comitative in his friend's company 
Manner slowly 
with «hammer 


Attendant circumstance without looking 
Agent [watched] by a neighbor 


Position [retained] by the lender 
Force [demolished] by a storm 


Degree to a large extent 


Comparison than you 
because of his personality 
with predictable consequences 
Purpose for his own ends 
Condition if in doubt 
Concession despite this 


Negative / restrictive seldom 
Focusing only 


Disjunct probably 


Conjunct therefore 


Gildea and Jurafsky suggest these 18 "abstract semantic roles": agent, cause, degree, experiencer, 
force, goal, instrument, location, manner, null, path, patient, percept, proposition, result, source, 


6. Van Valin, “Functional Relations," 151, text interleaved. 

7. Josef Ruppenhofer et al., FrameNet II: Extended Theory and Practice, 25 August 2006. http://framenet.icsi 
-berkeley.edu. In FrameNet II parlance, semantic roles are frame elements. 

8. Whaley, Introduction to Typology, 67. 

9. John H. Connolly, Constituent Order in Functional Grammar (Berlin: Foris, 1991) 71-72. 
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state, and topic;'° Van Valin proffers these 11 “thematic relations": agent, experiencer, instrument, 
force, patient, theme, recipient, goal, source, locative, and path;'' and Whaley puts forward these 
13 roles: agent, benefactive, comitative, experiencer, goal, instrumental, locative, patient, purpo- 
sive, recipient, source, temporal, and theme." We note that: 


1. Some items appear in all four lists. For example: instrument(al) and place (Connolly) = 
location (Gildea and Jurafsky) = locative (Van Valin, Whaley). !° 

2. Some items appear in only one list. For example, Gildea and Jurafsky include percept, not 
found in other lists. 

3. Some distinctions do not withstand scrutiny. For example, Van Valin defines force as 
“involuntary causal participant which, unlike an instrument, cannot be manipulated. They 
can include things like tornados, storms, and acts of God, as in The flood washed away the 
village." '* But contrast these clauses: 


a. Due to poor levee maintenance, the flood washed away the village. (Instrument?) 
b. As Divine retribution, the flood washed away the village." (Force?) 


10.1.2 Our Approach to the Role Repertoire 

We have followed the usual approach, namely, “to posit as few roles as possible, identifying a 
new role only when it seems to be required." !6 Over time, we have added roles, reaching our pres- 
ent tally of 44 semantic roles." The table on p. 138 shows our categories compared with others’ 
categories. Of our 44 semantic role (SR) categories, 21 are not used by our four reference works. 
We have found that they need to be available for real distinctions arising from the combined effects 
of the semantics of verbs, prepositions, and nominals in Hebrew composition. The SRs peculiar to 
our work fall into several natural groups: 


* Two geometrical SRs: 
area 
length 
* Path specifier SRs: 
movement bearing and movement origin 
time aim and time origin 
* "Balancing SRs”: 
harmed one (opposite of beneficiary) 
undesired outcome (equating result to “desired outcome”)! 


10. Gildea and Jurafsky, “Automatic Labeling,” 279. 

11. Van Valin, “Functional Relations," 152. What Van Valin and others call “theme,” we call “exocentric absolute" 
to avoid confusion with the more usual definition. See §21.1.2.3. 

12. Whaley, Introduction to Typology, 65. 

13. The definitions of agent differ. 

14. Van Valin, “Functional Relations," 152. 

15. By our present conventions ($9.2.1), we will label the flood as the subject in both clauses, hiding its semantic 
role(s). 

16. Whaley, Introduction to Typology, 66. 

17. Three of these SRs will appear later in our full representation: doer, experiencer, and patient (see 89.2.2). 

18. This is one area of our system where we may have gone too far. A result may be neither a desirable nor an 
undesirable outcome. It may be wise to remove undesirable outcome from our collection of semantic roles or to add 
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Andersen-Forbes’ Compared with Others’ Semantic Roles (SRs) 

A-F SR Others’ SR Used by* A-F SR Others’ SR Used by 
Accompanier Comitative C, W Material /composition 
Agential Agent C, W Movement aim Goal G, V, W 
Aim / purpose Purpose Movement pearing 
Area Movement origin 
Beneficiary Beneficiary Number count 
"But-rather" Number of times 
Cause Cause Patient Patient G, VW 
| Comparison Comparison C Possessor Recipient V, W 
Concessive Concession Quantity / quantifier 
[unm [Conlon — [c mee | 
Cost Reason Purposive W 
e absolute a G, V, W Result — Result C, G 
Involved ones ones Time interval interval Duration E 
——— 


Manner 


Manner C, G Undesired outcome 


a. C = Connolly, G = Gildea and Jurafsky, V = Van Valin, W = Whaley. 


Miscellaneous SRs: 


alternate 
“but-rather” 
cost 
deprivation 


involved ones 


number of times 


quantity 
quoter 


reference 
ruled-over one 


material / composition separation 


number count 


For the record, the four references use a dozen SRs that we have not (yet) had occasion to introduce: 


desired outcome as a balancing item. 
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SR Terminology English Example Used by 
Attendant circumstance | without looking C 
Conjunct therefore C 
Degree to a large extent C-G 

c 

c 
Force [demolished] by a storm CG,V 
Negative / restrictive seldom C 
Null it would be foolish G 
Percept It is apparent that . . . G 
Position [retained] by the lender C 

G 
State he was hollering at them G 


Several of these seem unattested in Biblical Hebrew. For example, we know of no null construc- 
tions (“dummy elements" as in English “t is raining”) in Biblical Hebrew. 


10.2 Representative Approaches to Semantic Role Taxonomy 


Here we arrive at the central matter of this chapter, the taxonomy of semantic roles. Underly- 
ing classification is a dilemma common to classification systems: category splitting. ® When should 
one subdivide a category? The literature suggests three ways of handling the “split/quit” dilemma: 


1. Map categories onto a line. Thereby, avoid discrete categories. 

2. Manual approach. Split maximally and then group hierarchically. 

3. Algorithmic approach. Use data to cluster the categories and then use a threshold to 
determine category granularity. 


10.2.1 Map Categories onto a Line 

The first option finesses the “split or quit” dilemma but suffers from arbitrariness. According to 
Van Valin, in the “continuum of [semantic roles], agent defines one end and patient the other; all 
of the other [semantic roles] represent points along the continuum.” ™® He then presents (10.1) to 
illustrate the positioning of his semantic roles: 
a | buen meme 
Agent Instrument Locative Patient 


< > 


(10.1) 


Source Path Goal Recipient 


19. This issue is discussed for lexical categories by William Croft, “Parts of speech as language universals and as 
language-particular categories," in Approaches to the Typology of Word Classes (ed. P. M. Vogel and B. Comrie; Berlin: 
Mouton de Gruyter, 2000) 76—79. See also our $3.2 above. 

20. Van Valin, “Functional Relations," 151. For consistency with our usage, we have replaced Van Valin’s “the- 
matic relations" with our "semantic roles." 
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This assumes that the roles lie in a one-dimensional space, but they may not. It also spaces them out 
uniformly, although this sort of spacing may be a distortion of the data. Forbes has shown that there 
are ways of determining whether textual data imply category positioning along a line and that there 
are ways of assigning data-derived distances between linearly ordered categories.” Techniques of 
this sort have not yet been applied to semantic roles, to say nothing of the roles appropriate to Bib- 
lical Hebrew. 


10.2.2 Manual Approach: Split Maximally, Then Group Hierarchically 

In the second option, one introduces as many categories as seem meaningful—the more the 
merrier. One then uses text characteristics to group the categories. 

This is the option that is typically adopted and the one that we used. It has the virtue of sim- 
plicity but leads to an uneven taxonomy wherein the categories exhibit differing granularity. As 
the initial attempt at defining semantic roles, it is useful, but ultimately a rule-based computational 
approach may provide superior results. 

The manual/intuitive approach to hierarchical grouping of semantic roles is common in the 
literature. For example, Van Valin presents the hierarchy of semantic roles shown in (10.2):” 


( 10.2) Thinker 


Believer 
Knower 
Presumer 


Hearer — 


Cognizer 


Experiencer* 


Smeller 

Feeler 

Taster 

Giver 

Runner 

Killer Agent 
Speaker 

Dancer 


Located 

Moved es 
Broken 

Destoyed Pat 
Killed 

Given to 

Sent to MM Recipient 
Handed to 


a. The idea, for example, is that the category of cognizer consists of thinker, believer, knower, 
and presumer and that cognizer and perceiver together form the category of experiencer; and so on. 


Perceiver 
Actor 


Undergoer 


2]. A. Dean Forbes, “Squishes, Clines, and Fuzzy Signs: Mixed and Gradient Categories in the Biblical Hebrew 
Lexicon," in Syriac Lexicography I: Foundations for Syriac Lexicography (ed. A. D. Forbes and D. G. K. Taylor; Pis- 
cataway, NJ: Gorgias, 2006) 123-28. 

22. Van Valin, “Functional Relations," 154. See also Connolly, Constituent Order, 71—72, where a two-level hier- 
archy is given. 
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When we intuitively group our semantic roles to form a hierarchy, we obtain this provisional 
taxonomy: 


Major Participants Spatial Phrasal Discourse Units 
Cause 
Experiencer Length Comparison 
Patient Location Deprivation 
Other Participants Separation (“ablative”) Quoter 
Accompanier Temporal Mixed-Level 
Agential Time aim/goal Aim / purpose 


Alternate / surrogate Time interval “But-rather” 
Beneficiary Time origin Concessive 


Exocentric absolute Time point Condition 


Harmed one Enriching Constituents | Reason 
Involved ones Cost Result 


Possessor Instrument Undesired outcome 


Ruled-over ones Manner 


Movement Material / composition 
Movement aim (target) Number count 


Movement bearing (direction) | Number of times 


Movement interval Heference 
Movement origin Hesource / supply source 
Quantity / quantifier 


10.2.3 Algorithmic Approach: Cluster and Threshold 

An intuitively fashioned hierarchy may seem exhaustively and appropriately descriptive, but it 
ultimately suffers from not being directly and explicitly based on measured text characteristics (for 
proposed text characteristics, see §10.3.4). In any event, investigating algorithmic approaches is 
beyond the scope of this volume. 


10.3 Semantic Role Classification Criteria 


Once we have settled on a set of semantic roles and grouped them hierarchically, we must de- 
vise methods of assigning semantic role categories to clause immediate constituents. 
10.3.1 Representative Classification Criteria 
Bornkessel et al. nicely summarize the challenges that we, as definers of semantic roles, face: 
[T]here is still no fully satisfactory model of how the syntax-to-semantics linking is accomplished. 
One reason for this appears to lie in the problems regarding the definition and scope of semantic 


roles that have continually reappeared since the very beginnings of research in this domain. . . . For 
example, researchers have vastly differed with regard to how many semantic roles should be assumed, 
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how these should be characterised both in semantic and in syntactic terms, how the different roles 
should be dissociated from one another, and which syntactic phenomena should be derivable from 
them. ... From the earliest approaches to semantic roles and their interface character between syntax 
and semantics, a central research focus has lain on defining the relation between these roles (both 
individual and generalised) and their corresponding syntactic categories.” 


It is to the last mentioned topic that we now turn. What is the relation between the semantic roles 
and their corresponding syntactic categories? Or, putting the emphasis on classification, is it pos- 
sible to classify CICs on the basis of their syntactic properties? Much work has been done in this 
area. Unfortunately, the work that we are aware of presupposes fixed phrase order at some point, a 
presupposition we are not prepared to make. Work on semantic-role classification criteria falls into 
two areas: manual and computational. 

Manual classification is carried out by humans following analysis-constraining handbooks,? 
with research invariably done on a verb-by-verb basis. The resulting sets of semantic roles are 
dauntingly extensive. One benefit of having the manually classified semantic roles is that they can 
be, and are, used to enable and evaluate computational analyses.” 

Computational classification uses algorithms embodying standard methods of pattern recogni- 
tion based on sets of features derived from the texts analyzed.” The goal of this analysis is to “infer 
role labels, given sentence constituents and a word from the sentence that is the predicator, which 
takes semantic arguments." 7 


10.3.2 The Complement-Adjunct Distinction 

In $7.3.4, we provided criteria for distinguishing complements and adjuncts taken from Pol- 
lard and Sag. We concluded that, given the limited size of our corpus, the criteria were of practical 
significance only for the frequently occurring verbs. In this subsection, we take up the distinction 
afresh, this time approaching it from the semantic role perspective. 

The FrameNet II formalism distinguishes three “levels of centrality” for semantic roles: core, 
peripheral, and extra-thematic.?* These are defined as: 


* "A core [semantic role] is one that instantiates a conceptually necessary component of a 
frame, while making the frame unique and different from other frames. 

e "[Semantic roles] that do not introduce additional, independent or distinct events from the 
main reported event are characterized as peripheral. Peripheral [semantic roles] mark such 
notions as time, place, manner, means, degree, and the like. 


23. Ina Bornkessel et al., eds., Semantic Role Universals and Argument Linking: Theoretical, Typological, and 
Psycholinguistic Perspectives (Berlin: de Gruyter, 2006) 1-3. 

24. Two well-developed manual projects are Ruppenhofer et al., FrameNet IT; and Martha Palmer, Daniel Gildea, 
and Paul Kingsbury, “The Proposition Bank," Computational Linguistics 31 (2005) 71-106. 

25. Having labeled instances of the semantic roles allows algorithms to select and weight syntactic features using 
the standard methods of statistical pattern recognition. (See "supervised learning" in any book on pattern recognition.) 

26. See Gildea and Jurafsky, “Automatic Labeling"; Cynthia A. Thompson, Roger Levy, and Christopher D. Man- 
ning, “A Generative Model for Semantic Role Labeling," Proc. ECML-2003; Kristina Toutanova et al., “Joint Learning 
Improves Semantic Role Labeling," Proc. 43rd Meeting of ACL; Trond Grenager and Christopher D. Manning, “Unsu- 
pervised Discovery," in Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing. 

27. Thompson, Levy, and Manning, “A Generative Model," 1. 

28. Although the FrameNet II terminology implies a particular positioning, the definitions of terms do not. Hence, 
the distinctions are relevant to both fixed and free phrase order languages. 
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* “Extra-thematic [semantic roles] situate an event against a backdrop of another state of 
affairs, either an actual event or state of the same type . . . or by evoking a larger frame 
within which the reported state of affairs is embedded.” ” 


Within this framework, Manning and co-workers propose a useful distinction between arguments 
and adjuncts: “while core arguments must be associated with a semantic role that is verb specific 
..., [peripheral] adjuncts are generated by a role that is verb independent.” 3° 

Palmer et al. provide a table of “general, adjunct-like arguments.”*! Deleting three pseudo- 
roles,*” we present their table 1, which looks like this: 


loc: location tmp: time 
ext: extent pnc: purpose [sic] 


adv: general purpose | mnr: manner 


cau: cause dir: direction 


10.3.3 Linking Functions to Constructions 

By way of reminder, the daughters of the clause root node are the clause immediate constituents 
(CICs; see $5.1). Each CIC dominates a daughter node, its label specifying a construction. The 
exercise in this subsection is to "link" the CIC function labels to the CIC daughter construction 
labels—that is, to see how the function and form labels correlate. Two examples will make this 
notion concrete. 

Invariable Linking. Let us first examine a situation in which a construction is invariably linked 
to a particular CIC. Consider the construct phrase 1177-083 ‘oracle of Yahweh’ (236x). Its form is 
n:vocbl/cstr. Its semantics (vocable) make it eligible to label a specimen of prophetic text, and so 
the form is, with one exception, ? immediately dominated by a Ibl/gram CIC node, as in phrase 
marker (10.3): 

(10.3) Ibl —  . n:vocbl “oN 


— . oracle of 


gram cstr Te 
m7” Yahweh 


Multiple Linkings. The prepositional phrase 7112 or "11? ‘to David’ occurs 147 times in 
Biblical Hebrew, 85 times as a CIC daughter. The linking question is: “What grammatical functions 
(GFs) and semantic roles (SRs) does this construction exercise and under what circumstances?” 
(10.4) 


GF? to + humn 
SR? > prep 


— >o 


TIT David 


29. A (semantic) frame is “a script-like conceptual structure that describes a particular type of situation, object, or 
event along with its participants and props” (Ruppenhofer et al., FrameNet II, 5). 

30. Grenager and Manning, “Unsupervised Discovery,” 3. Also Kristina Toutanova et al., “Joint Learning,” 590. 

31. Palmer, Gildea, and Kingsbury, “The Proposition Bank,” 76-77. 

32. The deleted items are: pis: discourse connective, NEG: negative, MOD: modal. 

33. In Isa 31:9, the construct phrase is in apposition to a nominalized clause, the combined structure being a label 
CIC. 
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The phrase is linked to either of two grammatical functions (ind obj [42x] and dir obj [1x]) or 
any of six semantic roles (refrnt [9x], Ibl [14x], poss [13x], benf [4x], mvt aim [1x], and mvt dir 
[1x]). Phrase marker (10.5) shows a typical ind obj instance from 1 Chr 11:5: 


(10.5) cl diseqwo — —,, 
oblq gram | and 
N vb P p 
gram ITAN t ey(m) sai 
A jd —————————9 "aw dwellers of 
dim Yebus 
ind obj to+humn  —  ,,; 
gram prep RU 5 to 
TT David 
obj addr cl neg , 
gram oblq S. oa Nb not 
ens — RİN thou(m) wilt come 
loc 


gram ^ ny here 


The predicator is 7X *they(m) said’. As we will see in $12.4.3.5, if the predicator root is NX, 
then a ^» «human» prepositional phrase is an ind obj 97.3% of the time. 

The single dir obj is shown in (10.6) from 1 Chr 18:6, where the verb is a Hiphil of yv»:? 
(10.6) 


cl = dl seq w 
oblq gram 


N^ ywi he saved 
sb mm Yahweh 


^ and 


gram 

chum e 
TIUS ee David 

Ku pp 3^ 
"DOR 55 allof 


nom 


nom —————————————WN which 


cl vb 


oblq — gram — 7172. he walked 


Our phrase is a label (Ibl) CIC 14 times, always in a psalm, as in (10.7) from Ps 26:1: 


34. Overall in pv» clauses, the phrase 117 occurs twice as dir obj (Ps 86:16 and 1 Chr 18:6). The direct object is 
marked instead with an object marker in very similar passages in 2 Sam 8:6, 14; and 1 Chr 18:13. 
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(10.7) cl Ibl to * humn 
oblq gram prep 7 to 
IT David 


The possessor role occurs 13 times. In phrase marker (10.8) for 1 Chr 18:13, the predicator is 777° 
‘they-were’. About 40% of the “7+ «human» phrases in m7 clauses are possessors (poss). 


(10.8) cl dl seq w ^ 
oblq N gram g 
vb ` 
gram y my 
sbj n:geog E 
gram 73 
b" DIT 
sbj cmp o3» 
gram Tet 
poss to + humn b 
gram prep f 


lom TU 


and 

they(m) were 
all of 

Edom 
servants 

to 


David 


We find four places? where 117% has the semantic role beneficiary (benf). In phrase marker (10.9) 
from 2 Sam 5:11, the clause predicator is 332^ ‘they built’ with root 33. 


(10.9) cl dl seq w r 
oblq N gram 
vb eoi i nd 
gram ur 
dir obj 
gram na 
benf to * humn b 
gram prep : 
DHT 


and 

they(m) built 
house 

to 


David 


Approximately half (54.5%) of the ^» + «human» phrases in 731 clauses function as beneficiaries 


(benf). 


The semantic role movement aim (mvt aim) is seen once, in 1 Chr 12:17, phrase marker (10.10). 


The predicator root is 82 ‘to come’. In clauses having this root, 


phrases of the shape ^5 + <human> 


have movement aim (mvt aim) as their function almost two-thirds of the time (63.5%). 


35. These are: 2 Sam 5:11; 1 Kgs 11:38; Jer 22:4, and 33:15. 
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D 
(10.10) cl dl seq w , 
oblq ^ gram 1 and 
NS NI they(m) came 
sbj from + humn : 
gram prep m from 
^ i `% sons of 
n:humn . i 
N union/dis; — ——————* 1223 Binyamin 
^ and 
mm Judah 

mvt aim until + bldg until . . 
gram prep * join Eg cR until 

^) to 
nd —— > the 

TX) stronghold 
mvt aim to + humn 
gram prep a D to 
:™T David 


We have seen that grammatical function and semantic role are not necessarily intrinsic to syn- 
tactic form as such. Rather GF/SR correlate with the semantics of the verbs with which they are 
associated. 


10.3.4 Proposed Semantic Role Syntactic Correspondences 


Important work has been done on describing “the relation between semantic roles and their 
syntactic realization . . . [the goal being] . . . to learn to recognize semantic relationships from 
syntactic cues, given examples with both types of information." ?* What is being described here is 
called "supervised learning." By having a subset of one's data marked up with both the independent 
and dependent categories of interest,” one can exploit standard methods of pattern recognition to 
infer rules for assigning the dependent categories, given the independent categories. Typically, the 
inferred rules are probabilistic, and one is engaging in statistical pattern recognition. 

In discussing syntactic features, we use a clause from Deut 7:20 for phrase marker (10.11): 


36. Gildea and Jurafsky, "Automatic Labeling," 247 and 252. 

37. For our approach to the problem of inferring semantic roles from syntactic information, the former are the 
dependent and the latter are the independent categories. Interestingly, Van Valin specifies algorithms for moving from 
syntax to semantics (87.0.2) or from semantics to syntax (§7.0.1): R. D. Van Valin Jr., Exploring the Syntax-Semantics 
Interface (Cambridge: Cambridge University Press, 2005) 225-28. 

38. See, for example, Richard O. Duda et al., Pattern Classification (2nd ed.; New York: Wiley-Interscience, 2000). 
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(10.11) a dl and 
oblq 1st in cl | and 
gam 
gram Di also 
dir obj obj mk * anml 
gram prep CC NR [nota acc.] 
n:anml ; 
def E n the 
nyn9 hornet 
vb i 
gram mw? he will send 
em. — mmr Yahweh 
gram app adm SEEN 
n:deity 
suff mt gods 
q thee(m) 
loc in * pron ^ in 
gram prep MEME C EN 
Do them(m) 
tm aim until + inf cstr - ; 
gram prep Ty until 
N nom inf inf cst ; 
oblq gram TAN perish 
sbj n:humn n:humn ^ the 
gram union/disj def - 
voeem left ones 
^ and 
n:humn j 
def n the 
ond] hidden ones 
mvt orig from + body prt "3 from 
gram prep . 
n:body prt ^ 
suff 15 face 
S[ thee(m) 


We examine GF/SR assignments for this clause considering the final three of six syntactic features 


proposed by Gildea and Jurafsky:*? 
1. Phrase type. 


“Different roles tend to be realized by different syntactic categories" 


(p. 252). In phrase marker (10.11), the first CIC is determined by its phrase type—actually, 
segment identity in this case—(dl and/ 1st in cl), as are the second CIC (gam/gram) and 
the fourth CIC (vb/gram). The grammatical function of the third CIC is determined by 

its phrase type because an obj mk«anml / prep node in a clause with an active verb is, 

with very few exceptions, a dir obj/gram CIC. For its semantic role, one might assign 


undergoer.*° 


39. Regarding the first three features, Gildea and Jurafsky (“Automatic Labeling,” 262) comment: “Three of our 
features, position, [governing category], and path, attempt to capture the syntactic relation between the [predicator] and 


the constituent to be labeled, and in particular to differentiate the subjects from objects of verbs.” 
40. Van Valin, Exploring, 54. 
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2. Voice. “The distinction between active and passive verbs plays an important role in the 
connection between semantic role and grammatical function” (p. 257). We will make use of 
this distinction. 

3. Semantics of head word.*' “[W]e expected lexical dependence to be extremely important 
in labeling semantic roles, as indicated by their importance in related tasks such as 
parsing” (p. 258). 


Our preference with regard to prepositional phrases is to use as features the identity of the prepo- 
sition and the grammatical features of the associated nominal. Consider the seventh CIC in phrase 
marker (10.11). The preposition is 7» ‘until’, and the nominal is a nominalized infinitive construct 
(nom inf/oblq). Searching our phrase markers, we find that there are 143 instances of a CIC that 
is a prepositional phrase consisting of ty ‘until’ plus a nominalized infinitive construct predicator 
(nom inf/oblq). All but 11 of these (92.3%) have their dominating CIC-node classified as a tm 
aim/gram. Hence, a rule stating that Ty ‘until’ plus a nominalized infinitive construct has a time 
aim semantic role will be correct approximately 92% of the time.*? 

Other feature sets have been suggested. Grenager and Manning seek to “learn a model which 
relates a verb, its [accompanying] semantic roles, and their possible syntactic realizations."^ To 
achieve this, they choose to “number the noun phrase (np), complement clause (cl, xcl), and adjec- 
tival complements (acmpl) appearing in an unbroken sequence directly after the verb, since this is 
sufficient to capture the necessary syntactic information." This approach strikes us as overly English- 
o-centric. Thompson, Levy, and Manning opted for a stripped-down set of features: “To simplify 
the model, we chose to represent each constituent by its phrasal category together with the head 
word of that constituent." ^ Their approach involves less work than more-expansive approaches. 

Regarding the phrase marker (10.11) for Deut 7:20, we have discussed inference of grammatical 
functions and semantic roles for all but the fifth CIC and the sixth CIC. The fifth CIC is syntacti- 
cally a masculine-singular apposition phrase having divine semantics. Because the noun phrase and 
predicator agree in gender and number, the noun phrase here is a candidate for filling the subject 
grammatical function (with semantic role of, say, doer). 

Finally, consider the sixth CIC in phrase marker (10.11) which is shown as a loc/gram semantic 
role label dominating an in+pron/prep constituent. How would one use the rather sparse syntactic 
facts to infer the semantic role label probabilistically? The syntactic facts here are these: 

* the constituent is a CIC, 

* itis a prepositional phrase, 

* its preposition is ^3 ‘in’, 

e its nominal is the pronoun 0’ ‘them’. 


An attempt to assign CIC labels purely probabilistically relies on observed ranked frequency of 
occurrence. For the current instance, if we search the phrase markers of Biblical Hebrew for prepo- 


4]. The “head” is “the central element which is distributionally equivalent to the phrase as a whole" (D. Crystal, A 
Dictionary of Linguistics and Phonetics [5th ed.; Oxford: Blackwell, 2003] 215). 

42. Eight of the 11 oddities have been classified as time intervals. The other 3 are in Judg 6:4, 2 Sam 5:25, and 
1 Kgs 18:46, and all involve an infinitive construct having the root N23. A further instance of this sort, where the con- 
struction leads off a conjoined (union) phrase, may be found in 1 Sam 17:52. 

43. Grenager and Manning, “Unsupervised Discovery,” 2. 

44. Thompson, Levy, and Manning, “A Generative Model," 6. 


10.3 Semantic Role CICs 149 


sitional phrases that are CICs consisting of “in” plus a pronoun, we find 1,349 instances distributed 
across 5 grammatical functions and 16 semantic roles. The distribution of the 4 most frequent (plus 
a residuum other) is as follows: 


CIC Type Incidence 
location 598 
451 


Percentage 
44.3 
33.4 


direct object 


instrument 


harmed one 


The table suggests that if we encounter an in+pron/prep constituent, then there is around a 50-50 
chance that it is a location, with 3 chances in 10 that it is a direct object, and so on. For the present 
phrase marker, however, we know even more. Since the presence of NX ‘[object marker]’ identi- 
fies a direct object CIC, we know that the constituent being classified most likely is not a direct 
object. Deleting that near-impossibility from the numbers yields this table of counts and adjusted 
percentages: 


CIC Type Incidence | Percentage 
location 598 66.6 
instrument 128 14.3 
harmed one 47 5:2 
other 125 13.9 
Total 898 100.0 


Given that we are not dealing with a direct object CIC, we know that the questioned constituent has 
a likelihood of two chances in three of being a location. The next most likely semantic role, instru- 
ment, has around one chance in ten of being correct. To improve the odds for correct assignment, 
one would need further information. Alternately, one might simply declare the top two or three 
most likely to be available and have the over-reader decide among them.* 

This study of the role assignment possibilities for a single clause is in no way definitive. But it 
does provide insight into the way a more extensive study might proceed, and the results that might 
be achieved.*° 

Given the foregoing proposed semantic role/syntactic form correspondences and given the at- 
tributes of Biblical Hebrew, one should attempt both to infer grammatical functions and to assign 


45. When we compare our present parsing of Deut 7:20 with the parsing of Deut 28:20 and 1 Sam 24:7, we see 
that our probabilistic assignment of SR to the sixth CIC in Deut 7:20 is likely incorrect: harmed one appears superior 
to location. 

46. Technical note. In the exercise just completed, we obtained the needed probabilitity estimates by surveying 
assignments made across all of Biblical Hebrew. In actual practice, to avoid circularity, one would manually classify 
a carefully designed subset of data (“the teaching data") and then use probability estimates based on outcomes in the 
teaching data to classify previously unseen data (“the testing data"). 
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semantic roles to clause immediate constituents on the basis of observable characteristics of clausal 
constituents such as these: ^ 


1. Predicator 2. Noun Phrases 4. "Certain parts of speech" 
identity (“root”) gender 5. “Certain set phrases” 
binyan number 
voice semantics 
gender 3. Prepositional Phrases 
number preposition identity 
semantics noun phrase semantics 


“Certain parts of speech” points to the fact, for example, that modals always are assigned the modal 
operator function. By “certain set phrases,” we refer to constituents such as mmm" DN) ‘oracle of 
Yahweh’, a phrase that is invariably a label CIC. 


10.3.4 The Way Forward 

We have elected to attempt neither computational classification of the clause immediate con- 
stituents into the five groups introduced in chap. 9 nor computational assignment of semantic roles 
from the list in §10.1.2. These are challenges for another day, for us or for others.** Instead, in the 
next seven chapters, we consider the computer- and expert-assigned CIC labels to be givens and 
proceed as follows: 


Chapter 11. We introduce methods of displaying the composition, incidence, and ordering of 
CIC types. 

Chapters 12-15. We then characterize the verb corpora defined by these four high-frequency 
verb roots: WAX ‘say’, mn ‘be’, nwy ‘do, make’, and jm ‘give’. 

Chapter 16. Next, we investigate the way that each CIC subtype is realized across all 
clauses having finite verbs, independent of verb root. 

Chapter 17. We conclude our investigation of CICs by presenting a computational method 
for discovering sets of verb roots that naturally group together. Clustering verb corpora 
into groups allows us elegantly and informatively to represent the characteristics that they 
share, facilitating important generalizations. 


10.4 Brief Summary 


The semantic role of a CIC discloses “the semantic relation that a nominal bears to the rest of 
the clause" (Whaley). Analyses differ with regard to repertoire, taxonomy, and recognition criteria. 

Semantic Role Repertoire. After examining the SR repertoires posited by various authors, we 
have arrived at a set of 44 SRs, many novel to our work. 


47. Including predicator gender / number and noun-phrase gender / number in our list of characteristics introduces a 
new quality into the mix: noun-verb concord. Noun-verb concord is present when a noun (phrase) and verb agree both 
in gender and number, allowing the noun (phrase) to be the subject of the clause under analysis. 

48. The creation and assessment of carefully reasoned taxonomies is quite involved. Assessing the adequacy of our 
taxonomies of parts of speech and of semantic roles is an iterative process. 
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Semantic Role Taxonomy. As for our SR taxonomy, we have followed one standard practice 
and intuitively grouped our SRs into a hierarchy with eight top levels: major participants, other 
participants, movement, spatial, temporal, enriching, phrasal discourse units, and mixed-level. 

Semantic Role Recognition Criteria. As we demonstrate, GF/SR classes are not necessarily 
intrinsic to specific syntactic forms; the characteristics of the clausal predicator are also critical for 
the classifications. 

Various sets of constituent features have been proposed for computationally inferring GFs/SRs. 
We provide an example that illustrates how this sort of detailed computational inference might 
work but have not yet investigated the matter. 

The present work takes as given the computer-assigned and human-adjusted GF/SR CIC label 
assignments and uses these to investigate CIC behaviors. 


Chapter 11 


Introduction to 
Clause Immediate Constituent Composition, 
Incidence, and Ordering 


In this chapter, we take up CIC type characterization and the quantitation of CIC incidence and 
ordering. We introduce ways of describing the CICs in the clauses containing a given root (§11.1). 
We next produce ClC-Incidence vertical bar charts (§11.2). Finally, we study CIC ordering, dis- 
playing pairs of CIC-Ordering horizontal bar charts (§11.3). The purpose of this chapter is to il- 
lustrate our methods using the small corpus of clauses having the root pan ‘desire’ or ‘delight in’. 


11.4 Clause Immediate Constituent Censuses 


11.1.1 The Binyan Census 

In upcoming chapters, we will first provide counts of the clauses with realizations of various 
binyanim. The binyan census for pan is simplicity itself: the verbal stock for pan consists of 78 
Qal actives. ! 


11.1.2 The CIC Subtype Census 

The tally of CIC subtypes is provided as a standardized table similar to table 11.1 for pan. The 
major table headings are the five CIC subtypes introduced in $9.1 with the final subtype split into 
its two components. In the left two-thirds of the table, the counts specify how many times each 
constituent occurs as a non-GF/SR CIC. In the right-most third, we tally the GFs plus the six most 
frequent SRs. Based simply on the table, one may advance a few generalizations regarding the pan 
corpus: (1) as presently analyzed, the verb pan is transitive; (2) only 27% of the clauses have an 
explicit subject; (3) pan appears with negation 23% of the time; (4) its clauses do not involve inter- 
rogatives; (5) not very many spatial or temporal specifications appear, but aim CICs do appear. To 
help make these concepts more concrete, consider phrase marker (11.1) from Ezek 18:23 (p. 153). 
The /abel CIC contributes a count of one to the /abel entry in the left-most panel of the table. The 
pan predicator adds a count of one to the finite verb entry in the middle panel. The nonpredica- 
tive infinitive absolute supplies a count of one to its category in the middle panel, and the closed 
interrogative is the sole instance of its category. The remaining CIC (dir obj) is a GE It contributes 
a count of one to the direct object GF entry in the right-most panel of the table.? 


]. We never include clauses with imputed elliptic predicators in our counts (see $20.2.3). For the present root, there 
is a clause with an ellipted predicator in Hos 6:6. This clause is not included in our tallies. Once, in Job 40:17, a pan verb 
in a clause has the sense “cause to bend down.” This clause is also excluded from our analyses. 

2. In our tallies, we include resumed constituents, since these are genuine clause constituents. (For the pan corpus, 
the single resumed CIC is a direct object in Isa 13:17.) 
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Table 11.1. CIC Subtype Census for the yan Corpus: 78 Structures 


Impermanents Count Predicators Count 


Underspecified “and” 7 | Finite verb Subject 


GFs Count 


Lapsus calami O | Predicative inf. abs. Direct object 


Nebulous 0 | Purely verbal participle Indirect object 
Syntactic Isolates | Count | Noun-verb participle Complement 


Vocative 2 | Infinitive construct Top Six SRs 


Label 2 


Exclamative 0 | Non-predicative inf. abs.* Aim /purpose 


Closed interrogative Time point 


Other conjunction^ 0 | Comparison 


Modal Accompanier 


a. To save some space, we include the aspectualizers (“nonpredicative infinitive absolute") in with the predicators. 
b. “Other conjunction" includes all CIC-level conjunctions other than those classified as underspecified “and.” 


Examples: 03/03 ‘also’, 48 ‘also’, 7D ‘that’, ix ‘or’. 


(11.1) cl q closed = 
oblq \ gram " 
inf abs int ; 
gram nmn 
vb ^ 
gram Pang 
dir obj nom inf inf cst ; 
gram oblq gram AW 
sbj ; 
gram 44 
Ibl n:vocbl 
cc cc 
gram cstr oN 
n:deity n:deity TIN 
app suff 2 "of 
` 
mm 


11.1.3 Survey of the Non-GF/SH Clause Immediate Constituents 


yes/no 

to desire 

| will desire 
die 

wicked 
oracle of 
lords 

me 


Yahweh 


While the primary focus of our examination of verb corpora is the composition, incidence, and 
ordering of the GF/SR items, for each verb corpus we provide a two-part survey of its non-GF/SR 
CICs. At each section beginning, we provide a summary of the main points of the material to be 
found in the remainder of the section. The main body of the survey then follows. The summary 


looks like this: 
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$11.1.3 Main Observations. We note that the pan corpus attracts very few syntactic 
isolates, that in almost all pan clauses the predicator is a finite verb (96%), and that al- 


most always negatives immediately precede the pan verb (94%) and are mostly clause 
initial (67%). 


11.1.3.1 Impermanents 
The seven underspecified “and” CICs represent unfinished work—CICs that one day will be 
promoted to discourse connectives (“cue phrases"; see $9.3.1.1). 


11.1.3.2 Syntactic Isolates 

There are two vocatives (clause initial in 2 Sam 24:3 and clause medial in Ps 40:9), and there 
are two labels (clause final in both Ezek 18:23 and 32; see §9.3.2). 
11.1.3.3 Predicators 

Noun-verb participles almost always precede their arguments and are viewed as anchored. The 
Pan corpus has three: Ps 34:13, 35:27; and Neh 1:11. In Ezek 18:23 (see phrase marker [11.1] im- 
mediately above), we have a cognate intensifier aspectualizer. In Job 13:3, we have a noncognate 
amplifier aspectualizer (see §3.2.6.1). 


11.1.3.4 Operators 

A single closed interrogative ‘4 ‘yes or no?’ appears in Ezek 18:23, and a single modal 3» 
‘would that’ occurs in Judg 13:23. Of the 18 negatives, 17 immediately precede their finite verb. 
The sole exception is in Ps 147:10, where the CIC sequence is neg .. . dir obj... pred (“not in the 
strength of the horse he will desire"). The negative CIC is clause initial in 12 clauses (67%). 


11.1.4 Survey of the GF/SR Clause Immediate Constituents 

The relation between the GF/SR functions and their forms, as revealed by the makeup of the 
CIC daughters, is a major focus of this and the following four chapters. For each verb corpus we 
give a two-part survey of its GF/SR CIC daughters. At each section beginning, we give a summary 
of the main points of the material to be found in the remainder of the section. The main body of the 
survey then follows. The summary looks like this: 


$11.1.4 Main Observations. The yan corpus includes very few adverbs and no 
clause(-like) structures. Its direct objects are often realized as prepositional phrases with 


^3 (62%). The corpus includes an aim semantic role fairly often (20% of clauses), pre- 
dominantly as a prepositional phrase with ^? (87%). Its bare substantive constituents 
almost always exercise subject or direct object grammatical functions (95%). 


Table 11.2 (p. 155) tallies the forms that appear as GF/SR daughters in the pan corpus. 


11.1.4.1 Adverbs 

When describing the clause sets of frequently occurring verbs, we use tables of tallies to present 
the census data. With the pan corpus, however, the CIC daughter adverbs are so few that we simply 
list them, their semantic roles, and their citations. 
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Table 11.2. GF/SR Daughter Forms 
GF/SR Daughter 
Adverb 

Clause(-like) struct.* 


Prepositional phrase 


Noun phrase 


Pronoun 


““Clause(-like) struct.” refers to clauses, sentences, and primitive discourse structures. 


Adverb Semantic Role Citation 
min ‘here’ location 1 Sam 18:22 
7ND ‘very’ manner [degree] 1 Sam 19:1 
ts ‘then’? time point Ps 51:21 


7ND ‘very’ manner [degree] Ps 112:1 


11.1.4.2 Clause(-Like) Structures 

The pan verb corpus contains no CIC daughter clause(-like) structures. 
11.1.4.3 Prepositional Phrases 

There are 54 CIC daughter prepositional phrases in the pan corpus. The preposition incidences 
are: 


In studying the linkages between GF/SR functions (as disclosed by their CIC labels) and GF/SR 
forms (as given by the CIC daughter labels), our policy for presenting data tallies is as follows. The 
data tables in the text present as individually identifiable counts only the category combinations that 
have at least some cell counts in excess of 5% of the total number of instances under investigation. 
Smaller counts are consolidated under “[other].” This is intended to allow readers to grasp the ma- 
jor characteristics of the data without having to wade through minutiae. 

The foregoing will be made clearer through examination of the tables for the frequent preposi- 
tional phrase types (PP types) of the pan corpus. In this and the next four chapters, readers should 
examine the tables, seeking generalizations that characterize the various verb corpora. 

The ^3 Prepositional Phrases. The censoring threshold is 5% of the total number of PPs being 
considered, 35 in the case of ^3. Hence, only the category combinations that have counts greater 
than or equal to (.05 x 35) = 1.75 are displayed in their own cells in table 11.3. 
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Table 11.3. ‘a PPs for yan 


substantive type 


dir obj | [other] 


[pronoun] 


quality 


human 


abstract 


document 


inf. construct 
[other] 


=I NININI WIA 


We observe for the pan corpus that: (1) any PP involving ‘a ‘in’ is most likely a direct object 
(94.3%); (2) around one-third of the direct objects involve ^3 plus a pronoun (17/52, 32.7%). 
The ^? Prepositional Phrases. Table 11.4 provides tallies for the 15 ^» PPs in the pan corpus. 


Table 11.4. ^? PPs for yer 


substantive type 
aim dir obj | reason 


open interrog. 


In the pon corpus, (^5 + inf. construct} realizes an aim SR 13 times, and {^} + open interrogative} 
realizes a direct object GF once and a reason SR once. 

The Remaining Prepositional Phrases. Four hapax prepositions remain to be examined. In 
general, we resort to simple listing when prepositions are feebly realized in a given verb corpus. In 
the present case, we therefore simply observe that: 


e {5x + deity} has an indirect object grammatical function (GF) in Job 13:3 
e (^3 + human} realizes a comparison semantic role (SR) in Isa 58:2 

e (10722 + abstract} has a direct object GF in Isa 42:21 

e {D9 + [pronoun]} realizes an accompanier SR in Ps 73:25 


11.1.4.4 Substantives 

By now we have dealt with each of the entries in table 11.2 except for the "bare" substantives: 
noun phrases (37x) and pronouns (3x). 

The censoring threshold is 596 of the total number of bare substantives being considered, 40. 
Hence, only the category combinations that have counts greater than (.05 x 40) = 2.0 have their 
own cells in the simplified table on p. 157. We remark that the bare substantives with pan exercise 
important grammatical functions (95%). 
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This concludes our illustrative censuses of clause composition in the pan corpus. In chaps. 
12-15, we will proceed along similar lines as we investigate the composition of the CICs involved 
in the clause corpora containing each of four frequently occurring verb roots. 


substantive type 
dir obj subj [other] 
human E 11 


inf. construct 2 
deity 6 
abstract 


[pronoun] 3 


thing 


[other] 


11.2 Clause Immediate Constituent Incidence Contours 


We now turn to the visualization of the incidences of CICs in the clauses in a given verb corpus. 

We are in a position to examine all of the clauses in Biblical Hebrew to see which CICs occur 
in verbal clauses. To visualize the relative incidences of CICs for the sets of clauses, we use a bar 
chart with bar heights giving the percentage incidence of the 13 constituents that occur more than 
5% of the time in the full set of 45,503 clauses with Qal active predicators. To understand the bar 
chart, refer to the double arrows labeled a, b, c, and d below the chart (see chart on p. 158). 

Important Note. We subdivide the predicators into anchored (sequential finite verb, infinitive 
construct, n-v participle, and construct participle) and unanchored (non-sequential finite verb and 
predicative infinitive absolute) subsets. We do so because of a peculiarity of the Hebrew sequential 
verb forms: they are “anchored” in the clause-initial position. They thus constrain Hebrew clause 
configuration in a way that contrasts with the partial free CIC ordering of other kinds of Hebrew 
clauses. Combining the two sorts of clauses has too often led to invalid inferences. 

The bar chart (p. 158) summarizes the CIC incidences across all of the Qal active verbal 
clauses making up Biblical Hebrew. It documents, for example, that: 


1. Unanchored predicators are the most common type (appearing in 54% of the clauses). 
2. Anchored predicators are in 46%, subjects in 3896, and direct objects in 48%. 
3. Only the 13 CIC classes shown appear in more than 5% of Qal active clauses. 


The incidence bar chart for our sample verb corpus, the pan corpus, appears on p. 159. We see 
that predicators are almost exclusively finite unanchored verbs. Subjects appear in just over one- 
quarter of the clauses and direct objects in nearly two-thirds. Negation is fairly common (23%), and 
aim / purpose appears in nearly one-fifth (19%) of the clauses. 

In two clauses in the pan corpus, a direct object GF and an aim/purpose SR co-occur. Phrase 
marker (11.2) from 1 Kgs 10:9 shows one of these clauses. A nearly parallel clause is in 2 Chr 9:8. 
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CIC Incidence across the Qal Active Corpus 
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a. The first five bars from the left up to the left-most vertical line below the bars [interval a] correspond to the avail- 
able choices for predicators; their percentages add up to 100. 

b. The next four bars [interval b] provide the observed percentage incidences for our grammatical functions plus 
subject complements lumped with object complements.? 

c. The next four bars up to the right-most vertical line [interval c] correspond to the four most common semantic 
roles (across all Qal active clauses) ordered from most to least frequent. 

d. To the right of the right-most vertical line [interval d], we leave space to display additional CICs that occur in 
more than 5% of the clauses for any particular verb corpus being analyzed. 


(11.2) a | , w 


oblq gram pon he desired 


N dir obj in * pron ^4 in 


gram prep DT Tre aen j 
1 


thee(m) 
em —, to € % to 
oe EE eben —————————————— nmn give 
m "T?  thee(m) 


loc upon * furn 
—- 
gram prep 


— > upon 


n:furn 
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bW? Israel 


3. The wisdom of including subject complements and object complements in with the grammatical functions might 
be debated. Given the present state of our knowledge, we include them because the incidence of these complements is, 
in fact, a function of the verb corpus that is being examined. 
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CIC Incidence across the Qal Active pon Corpus 
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11.3 Clause Immediate Constituent Ordering 


A significant innovation of generalized phrase structure grammar was the separation of the 
dominance aspects of syntax from the precedence aspects.* As Pollard and Sag put it:? 


In every human language, there are language-specific constraints upon the linear order of sister con- 


stituents which apply to all the signs of the language, and hence are to be factored out of the grammar 
rules. 


The study of the ordering of constituents has been aptly called “positional syntax."^ At clause level, 
positional syntax attempts to account for the "relative position, proximity of position, and absolute 
position" of the CICs that make up clauses. 

Here, we discuss an approach to positional syntax that some have applied to Biblical Hebrew, 
fields-based positional syntax. We then introduce and illustrate our preferred modified approach.* 


4. Gerald Gazdar et al., Generalized Phrase Structure Grammar (Oxford: Blackwell, 1985) 44—50. As we noted 
in $4.1 and in our glossary, precedence deals with the ordering of constituents. For example, the segments making up a 
text occur in precedence order, which matches their physical sequence in the text. 

5. Carl Pollard and Ivan Sag, /nformation-Based Syntax and Semantics, vol. 1: Fundamentals (Stanford, CA: CSLI, 
1987) 169. 

6. John H. Connolly, Constituent Order in Functional Grammar: Synchronic and Diachronic Perspectives (Berlin: 
Foris, 1991) 3. 

7. Ibid., 4. 

8. In appendix 3, we briefly sketch two alternate ways of investigating positional syntax that do not suit our require- 
ments: (1) linearization: phrase markers mapped onto the “word order domain"; and (2) argument structure: valency 
lists ordered by “obliqueness.” In appendix 3, we also introduce some factors proposed in the linguistics literature to 
account for CIC ordering. 
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11.3.1 The Fields Approach to Positional Syntax 

“This approach divides a clause into several word order ‘fields’ . . . whose mutual position is 
fixed, and studies mainly the word order regularities within these ‘fields.’ "? The method has a long 
history, especially among German syntacticians.'? A simplified version relies upon five fields—the 
forefield, left bracket, middlefield, right bracket, and postfield. “[A]ny German sentence can be 
divided from left to right into [these] fields whose contents will be homogeneous across sentences, 
despite variation in word order."!! How the field / zone contents are determined has been specified 
in various ways. The following two approaches represent the extremes of complexity. 


11.3.1.1 Specification by Rules 

At one extreme of complexity is Connolly's approach to positional syntax. Based on an interest- 
ing taxonomy of CICs adapted and extended from the taxonomy fashioned by Dutch linguist Simon 
Dik, Connolly goes to elaborate lengths—literally—to write a positional syntax. For example, 
Connolly's rule (81a) for placing “all adverbials except focusing adjuncts” ? into a massive 17-slot 
template takes up just under 5 pages in his book. '* His system is impressive in its seeming attention 
to coverage, but it is so complex as to be all but impossible to verify. Further, we doubt whether 
inferences based on it could ever attain statistical significance for our smallish corpus. 

In the spirit of the functional grammar approach, '? we wonder if some of the complexity in 
Connolly's approach results from trying to account for what in reality are pragmatic effects or dis- 
course effects by enforcing (inappropriate) syntactic constraints. If this is true, then inclusion of the 
dimension of discourse should enable a substantial simplification of positional syntax. 
11.3.1.2 Specification by Statement 

Seemingly much simpler are summary statements such as the following: 


In a declarative main clause . . . , the subject occupies the forefield, the finite verb is the left bracket, 
and complements and modifiers typically appear in the middlefield. . . . [Alternatively] the forefield 
holds the direct object of the sentence. In general, the forefield may hold no more than one of the verb's 
arguments or modifiers; the question of which one is generally taken to be context determined. '° 


From our perspective, this sort of statement can be made, if ever, only after exhaustive research has 
been carefully carried out. Before we would venture this sort of summary statement, we would need 
to examine every verb root corpus in Biblical Hebrew. Not having done the research for Biblical 
Hebrew, we are not prepared to adopt the theory relied on by fields-based positional syntax. 


9. Karel Oliva, “The Proper Treatment of Word Order in HPSG,” Actes de COLING-92, 184. 

10. Michael W. Daniels, Generalized ID/ LP Grammar: A Formalism for Parsing Linearization-Based HPSG 
Grammars (Ph.D. diss., Ohio State University, 2005) 20. Regarding Biblical Hebrew, see Walter Gross, Die Satzteilfolge 
im Verbalsatz alttestamentlicher Prosa (Tübingen: Mohr, 1996). 

11. Daniels, Generalized, 22. 

12. Connolly, Constituent Order, 71—72. 

13. Ibid., 87: “Focusing adjuncts, such as only, merely, or just.” 

14. Ibid., 73-77. 

15. Ibid., 1: “[A]dopting a functional approach entails regarding pragmatics as the over-arching framework to which 
other aspects of linguistics must be related. According to this view, then, semantics subserves pragmatics, and syntax 
in turn subserves semantics." For us, pragmatics, semantics, and syntax do not form a strict top-to-bottom hierarchy. 

16. Ibid. 
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11.3.2 A Descriptive Approach to Positional Syntax 

The inconclusiveness of the foregoing leaves us with a question: what approach(es) to visual- 
izing positional syntax should we use? Our approach to positional syntax is descriptive. For predi- 
cator root/binyan/voice combinations that occur frequently enough to allow meaningful charac- 
terization, we generate horizontal bar charts showing various patterns of CIC sequencing. Our goal 
is to show what is in the texts, not to account for it (at least not at this point in our investigations). 


11.3.2.1 Nucleus, Core, and Periphery in Role and Reference Grammar 

The basic concepts upon which our description relies have been described by Robert Van Valin, 
Jr." Van Valin’s role and reference grammar benefits greatly by being the result of his address- 
ing the question: “What would linguistic theory look like if it were based on the analysis of lan- 
guages with diverse structures such as Lakhota, Tagalog and Dyirbal, rather than on the analysis of 
English?" 5 In response, Van Valin introduces '? 


a very different conception of clause structure . . . from that assumed in other approaches. . . . [T]he 
conception of clause structure that it posits [is] equally applicable to free-word-order, flat-syntax lan- 
guages such as Dyirbal and Malayalam, to head-marking languages like Lakhota and Tzotzil... , and 
to fixed-order configurational, dependent-marking languages like English and Icelandic. 


To achieve this sort of coverage, Van Valin relies on his concept of “the layered structure of the 
clause." Its full explication consumes much of his book. For our purposes, we need to understand 
only what he calls “the primary constituent units of the clause." They are (we quote): ?? 


* the ‘nucleus’, which contains the predicate (usually a verb),”! 

* the ‘core’, which contains the nucleus and the arguments of the predicate, 

e à ‘periphery’, which subsumes non-arguments?? of the predicate, e.g., setting locative and 
temporal phrases. 


“The universal aspects (the nucleus, core, periphery and clause) are all semantically motivated." ? 
Now, here is a crucial point:?* “Since these hierarchical units are defined semantically and not syn- 
tactically, . . . the elements in these units may in principle occur in any order, if a given language 
permits it." We emphasize that Van Valin's terminology relates to semantic definitions of nucleus, 
core, and periphery. Contrary to the spatial imagery that inheres in the names of the hierarchical 
units (“core,” “nucleus,” "periphery", they do not imply a particular grouping in surface structure. 

In carrying out our investigation of these matters, we will also need to be very diligent in de- 
tecting ordering constraints that have nothing to do with the semantic-to-syntactic mapping but, 
rather, result from other constraints.? For example, for presentational reasons, objects of address 


39 ec 


17. R. D. Van Valin Jr., Exploring the Syntax-Semantics Interface (Cambridge: Cambridge University Press, 2005) 
3-8. 

18. Ibid., 1. 

19. Ibid., 3—4. 

20. Ibid., 4. 

21. “Predicate” here equals “predicator” in our terminology. 

22. “Non-arguments” here equals “adjuncts” in our terminology. 

23. Ibid., 8. 

24. Ibid., 5. 

25. Note well: we are describing the ordering of constituents. We are not seeking to account for the ordering, 
although interesting—if premature in our view—efforts have been made to do so. In this connection, see Christo H. J. 
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All Core Patterns. Anchored Core Patterns. Unanchored 
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Predicator Type | Rank #1 Rank #2 | Rank #3 | Rank #4 Rank #5 | Rank #6 
Anchored VO | 32%|V 23% | VS | 16% VSO | 796 VIO | 4% | VSIO| 3% 
Unanchored V | 26%| VO 18% | VS | 12% | SV | 10% OV | 6% | SVO | 5% 


(i.e., speeches) overwhelmingly are clause final. This assists cognition, since for all but the short- 
est utterances, the processing of “and Moses said [long speech]” is far simpler than would be the 
processing of “and Moses [long speech] said.” 
11.3.2.2 The Positional Syntax of Grammatical Function CICs 

The Descriptive Categories Adopted. The number of CIC sequences attested in Biblical He- 
brew is vast. The 68,089 verbal clauses exhibit 9,623 patterns. This is because we make use of so 
many CIC categories: three impermanent, three syntactic isolate, six predicator, four operator, four 
GFs, and 41 SRs—61 categories in all. To examine the positional syntax of GFs, we extract clause 
cores by removing from the clausal CIC sequences all non-GF/non-predicator types: 


1. Each underdetermined “and” is part of a clause only because we have not promoted it to 
cue phrase status. The other impermanents are so few that their presence or absence can 
have little effect upon the results of our analysis (see §9.3.1.1). 

2. The syntactic isolates, being isolated, are excluded from the clausal sequences (see $9.3.2). 
Altogether, these items account for 1.1% of the CIC population. 

3. We exclude the operators on the grounds that 98% of them either appear immediately 
before the verbs that they operate upon or are clause initial. 


van der Merwe and Eep Talstra, “Biblical Hebrew word order: The interface of information structure and formal fea- 
tures," Zeitschrift fiir Althebrüistik 15/16 (2002-3) 68-107. 


11.3 Clause Immediate Constituent Composition, Incidence, and Ordering 163 


pon Core Patterns. Anchored Core Patterns. Unanchored 
vo vo BEEN 
vs vs M 
vo NENNEN VO EENNNNNNNSNSAÀNND 
V vuo | 
vc vc | 
SV sv | 
ov CF 
IV iv | 
CV Cv 
vso vso NH 
VSI VSI 
vsc vsc 
vos vos EM 
vol vol 
voc voc 
VIS VIS 
vio vio 
Svo svo EE 
OVS ovs I 
Other Other M 
T T T T T 1 r T: T T T T ] 
0 10 20 30 40 50 60 0 10 20 30 40 50 60 


Anchored 


Unanchored 


4. To reduce the repertoire of CIC sequence patterns, we delete resumed arguments. 
5. For the present investigation of the positional syntax of grammatical-function CICs, we 
delete the SR CICs, leaving only the predicator CICs and GF CICs. 


If a verbal clause has either an anchored or an unanchored predicator, and if it may have no more 
than one subject, direct object, indirect object, and/or complement, then one can show that there 
are 261 possible core sequence (GF sequence) patterns. In fact, we find 189 core sequence patterns 
in Biblical Hebrew. Of these patterns, 63 occur once and 23 occur twice. 


11.3.2.3 Core Patterns across the Qal Active in Biblical Hebrew 

We plot percentages in horizontal bar charts across all Qal active clauses with anchored or un- 
anchored predicators. We use 19 frequently occurring sequence patterns plus “other.” Note that in 
all cases lacking a verb (‘V’) in clause-initial position, the relative frequency is higher for the unan- 
chored verb corpus than it is for the anchored verb corpus (for example, the SVO pattern is far more 
common when the Qal active verb is unanchored than when it is anchored). See chart, p. 162. 

So much for core CIC sequence patterns across the Qal actives of Biblical Hebrew. We are 
much more interested in the comparative behavior for specific verb root corpora. In chaps. 12-15, 
we provide side-by-side bar charts of core CIC ordering patterns, verb corpus by verb corpus. 
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11.3.2.4 Core Patterns for the pan Corpus 

In upcoming chapters, we show bar charts for clauses with anchored or unanchored predicators. 
By way of example, the relative frequencies for the ordering patterns in the pan corpus are shown 
in the chart above, p. 163. 

The left-hand bar chart is misleading, because it presents the CIC sequences in the mere three 
clauses in the pan corpus that contain anchored predicators. Because the sample size for the an- 
chored corpus is so small, we are well advised to avoid making comparisons between the unan- 
chored and anchored corpora. We do observe, however, that in 3296 of the pan corpus, the predica- 
tor is not clause initial (i.e., the sequence is OV, SVO, SV, OVS, or IV). 


11.3.2.5 SHs amid GFs 
There is one final matter that we investigate in chaps. 12-15: with what frequency do SR CICs 
appear physically amid the GF CIC constituents? We proceed as follows: 


1. Divide the SR CICs into six groups. Table 11.6 shows our chosen grouping. 


Table 11.6. The Six SR CIC Groups 


Accompanier Movement aim (target) Manner |Manner 

~ | Agential E S Movement bearing (direction) p Aim/purpose 

Š Alternate / surrogate Š EH Movement interval E a Reason 

- | Beneficiary S «| Movement origin 2 = Result 

Š Exocentric absolute = S Length SE Undesired outcome 

5 Harmed one Location = 2 Comparison 

* | Involved ones Cost Š S | Cause 

z Possessor .. | Instrument EE Deprivation 
Ruled-over one = S Material / composition = Quoter 

~ Time aim (goal) $ = Number of times 

S Time interval Ñ 3 Reference For future reference, this table is re- 

= Time origin Resource (supply source) produced inside the back cover. 

E Time point Quantity / quantifier 


2. Across each verb corpus, find all contexts where GF CICs (the predicator and its GF 
arguments) are not contiguous because of “interpolated” SR CICs. 
3. Document these contexts using filled-in versions of table 11.7. 


Table 11.7. SR Interpolations 


# Qal Percent of SR-Groups Interpolated amid <???> GFs 
Active Other Movement Enriching | Mixed Level 
Root | Clauses | Participant | and Spatial | Temporal | Manner | Constituent | and DU 


To see our perspective, consider this English clause: “Jean said ‘Hello’ to Mike at 6 A.M.": sbj — vb 
— obj addr — ind obj — time point. In this clause, the GF CICs are contiguous: sb — vb — address — 
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ind obj. The time point SR (“at 6 A.M.”) is physically in the clause’s periphery. If we had “Jean at 6 
A.M. Said ‘Hello’ to Mike": sbj — time point — vb — obj addr — ind obj, then the GF CICs would not 
be contiguous, the time point SR being interpolated between the subject and the predicator. 

As it happens, the pan corpus has no interrupted GF sequences. Consequently, actual data on 
SR interpolations first appear in $12.6.2. 


11.4 An Aside on Valency 


11.4.1 The Perspective of Linguistics 

Valency focuses on “the range of syntactic elements either required or specifically permitted by 
a verb or other lexical unit. . . . An element which is required is an obligatory valent; one which 
is specifically permitted but is not required is an optional valent.”** “[A] given element may have 
different valencies in different contexts. . . . Valency deals not only with the number of valents 
with which a verb is combined to produce a well formed sentence nucleus, but also with the clas- 
sification of the sets of valents which may be combined with different verbs."?/ In linguistics, 
the concept of valency is usually presented as less hard-edged than the concept of the obligatory 
complement versus the optional adjunct, but valency's utility is hindered when it relies on vague 
notions such as “well formedness.” 


11.4.2 The Perspective of Biblical Hebrew Studies 

Waltke and O'Connor define valency (“valence” in their index) as “the number of links a gram- 
matical element, especially a verb, has [with] other elements.” ?* Their main treatment of the matter 
in their §10.2.1 demonstrates that they take a flexible-valency perspective.” Van der Merwe et al. 
have a fixed-valency perspective: “The valency of a verb refers to the number and nature of the 
obligatory constituents.” ?? The study of valency in biblical studies is largely a European pursuit. A 
recent work on the subject is in German.?! 


11.4.3 Valency Cautions 

We appreciate the usefulness of the concept of valency for pedagogy. But as a way of organiz- 
ing and reporting research results, we find it to be chancy because of its operational vagueness, its 
risk of falling into the “translation trap,” and its rather limited applicability. 
11.4.3.1 Operational Vagueness 

Surveying our data for frequently occurring verbs the corpora of which invariably contain 
obligatory constituents, we find none. Specifically, examining the 101 roots that occur more than 
59 times as Qal actives (37,458 clauses), we find neither roots with their clauses always having a 
free-standing subject nor roots with their clauses always having a free-standing direct object. The 


26. P. H. Matthews, Oxford Concise Dictionary of Linguistics (Oxford: Oxford University Press, 2005) 394. 

27. David Crystal, A Dictionary of Linguistics and Phonetics (5th ed.; Oxford: Blackwell, 2003) 487 (italics mine). 

28. B. K. Waltke and M. O'Connor, /BHS, 694. 

29. Ibid., 163-69. 

30. C. H. J. van der Merwe, J. A. Naudé, and J. H. Kroeze, A Biblical Hebrew Reference Grammar, 368 (cross- 
referencing emphases deleted). 

31. Michael Malessa, Untersuchungen zur verbalen Valenz im biblischen Hebräisch (Assen: Van Gorcum, 2006). 
See also Christo H. J. van der Merwe, “Review of Malessa’s Untersuchungen zur verbalen Valenz im biblischen He- 
brüisch," Review of Biblical Literature 4 (2007). 
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table shows the five roots with the largest incidence of subjects and the five with the largest inci- 
dence of direct objects. 


Frequent Subjects Non 76.6% | 752 68.1% |tax 67.6% | 45m 66.7% nn 66.7% 


Frequent Direct Objects | vv 94.2% |qoN 94.2% | 335 93.9% |anx 93.9% |aAnp 93.8% 


Or consider the large corpus organized around Qal actives of the root (n3, a corpus of 1,896 
clauses. Van der Merwe et al. cite jmn3 as an example of a “three-complement” verb consisting of 
"[v]erb + subject + object + indirect object ... Jonathan gave his weapons to the boy (1 Sam. 
20.40).”*? Phrase marker (11.3) from 2 Kgs 22:10 provides an OVIS-instance of this prototypical 
“three-complement” situation. 


(11.3) cl dir obj *590 book 
oblq A gram | 
vb 
an ini he gave 
ind obj to+pron — ^ 5.5 to 


viai prep lnc M 
*> me 


sbj n:humn 


gram app |  "* n»n Hilgiyyah 


b n:humn " the 
^ 113 priest 


In fact, only 8% of the 1n1 clauses contain all three arguments, sbj — dir obj — ind obj. The table 
shows how often each argument actually appears in the j3 corpus. 


Percent 
Argument Count Incidence 
Subject 


Direct Object 


1,496 7996 
Indirect Object 954 5096 


Looking across several languages (but not including Biblical Hebrew), Butt and King suggest “that 
argument drop is licensed at the level of discourse structure and that only continuing topics or 
background information may be omitted.” Research topic: Does this hold for Biblical Hebrew? It 
is not sufficient simply to assert that the three arguments are ontologically necessary, and so, when 
any are missing, they are either ellipted or are “understood.” A precept that asserts that certain con- 
stituents are obligatorily present in a clause except when they are not lacks operational usefulness. 


32. Van der Merwe, Naudé, and Kroeze, Biblical Hebrew Reference Grammar, 173. 
33. Miriam Butt and Tracy King, “Null Elements in Discourse Structure,” in Papers from the NULLS Seminar (ed. 
K. V. Subbarao; Delhi: Moti Lal Banarsi Das, 2000). Available from the Web. 
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11.4.3.2 The Translation Trap 

If 7n1 in Biblical Hebrew had a semantic range analogous to ‘give’ in English, then the three 
arguments stated to be obligatory would be ontologically necessary. But, what is true of English 
is not necessarily true of Biblical Hebrew. The semantic range of (n3 differs from the range of En- 
glish ‘give’. For example, in addition to the three-argument (sbj — dir obj — ind obj) sense usually 
translated into English by some form of ‘give’, there is also a three-argument (sbj — dir obj — loc) 
sense translated by some form of ‘place’. In this circumstance, the preposition in the location CIC 
is bp ‘upon’. 

Phrase marker (11.4) from Exod 25:30 shows an instance where ju has this sense. Were we to 
distinguish this sense, the gloss would read something like ‘thou(m) wilt place’. For this sense of 
13, it is reasonable to assert that loc, and not ind obj, is an argument. Note well: here is a context 
of use where what we have been considering an SR (loc) might best be considered an argument of 
its verb, giving it standing as a GE When the verb corpora of Biblical Hebrew are studied in full 
detail, we expect that variants of this phenomenon will appear repeatedly. 


(11.4) cl dl seq w : 
oblq gram ) and 
vb TERM 
gram nni thou(m) wilt give 


loc upon * furn 
— 


gram prep "»y upon 


T pu " the 
b invw table 


gram cstr 12 bread of 
D5 face 


mvt aim to + body prt 


gram prep 
n:body prt P 
suff 1D face 
linn ` me 
mnr i - 
gram [TAM regularly 


11.4.3.3 Limited Applicability 

The smaller the corpus organized around a given root, the less confidence one can have in gen- 
eralizations based on it. To begin with the starkest example: although we can state with certainty 
the valence observed for one-clause corpora involving hapax verb roots, descriptions of this sort are 
certainly not useful generalizations. Of the 968 Qal active verb roots attested in Biblical Hebrew, 
237 occur only once (24%). There is little point in talking about the valency patterns of these verbs. 
To make convincing generalizations, we would need to group the roots into sets reliably. 

How large should a corpus be before we seek to generalize about the quantitative and qualita- 
tive valency of its verb? In chaps. 12-15, we study verb corpora consisting of 2,000—5,000 clauses. 
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It should be possible to investigate valency in the 101 verbs occurring 60 times or more, but this is 
a mere 4% of the total verb lexeme stock. 


11.5 The Structure of Chapters 12-15 


11.5.1 The Verb Corpora Chosen for Analysis 


We count 68,089 verbal clauses in Biblical Hebrew. The census of the 20 most frequent roots 
in predicators is as follows: 


Count Root | Count Root | Count Root | Count 
1.991 yr| 929 719 
1,497 xx? 1,034 noy 873 726 
1,280 now 827 719 
1,128 np 960 av» 815 608 


These 20 roots appear in 29,496 (43.3%) of the 68,089 verbal clauses in Biblical Hebrew. In chaps. 
12-15, we examine Qal active corpora for four frequent roots: WAX ‘say’, mn ‘be’, nwy ‘do, make’, 
and n3 ‘give’. (A fifth frequent root, N3 ‘come’, is treated with verbs of movement in §17.3.2.) 


11.5.2 Reading Options 

In a book such as this, there is a tension between providing too little detail and providing too 
much. While some readers will want access only to the main arguments and supporting data, others 
will want more detail. In an attempt to provide for each sort of reader, we have designed chaps. 
12-15 to be read to two depths: 


* Readers seeking the main points might read only the “Main Observations" previews at the 
beginnings of many sections. When there is no preview, our advice is to scan the section at 
least, studying the phrase markers along the way. 

* Readers who are seeking more detail should read the chapters in toto. 


11.6 Brief Summary 


CIC Ordering. After considering several approaches to the study of positional syntax and 
finding each unsuitable in some way, we opt for a softened version of the approach propounded in 
Van Valin's role and reference grammar, which is a descriptive approach. 

Aside on Valency. Because of its vagueness, its tendency to fall into “the translation trap," and 
its limited applicability, we do not use valency to organize our research or reports thereon. 

The Organization of This Chapter and Chapters 12—15. For each verb corpus in chaps. 12-15, 
we parallel this chapter's progression: 


1. Four Censuses: 
a. Binyanim. The various binyanim attested. 
b. CIC Subtypes. Census of the five CIC subtypes. 
c. Non-GF/SR CICs. Tallies of the non-GF/SR CICs encountered. 
d. GF/SR CICs. How the GF/SR CICs are realized (form-to-function linking) for 
common prepositional phrases and bare substantives. 
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2. CIC Incidence Patterns: Four-part vertical-bar tables indicating predicator incidence, 
grammatical function incidence, most-frequent SRs, and verb-corpus-specific frequent SRs. 
3. CIC Ordering (“positional syntax"): 
a. Core Sequences. Pairs of fixed-format horizontal bar charts showing the incidence of 
various clausal GF CIC sequences for anchored and unanchored predicators. 
b. Core “Intrusions.” Tallies of the rate at which various SR CIC groups (defined inside 
the back cover) interpose into sequences of clausal GF CICs. 


12.1 


Chapter 12 


The NN Corpus 


The Binyan Census 


In $10.3.3, we included binyan in our list of grammatical features that might assist in inferring 
links between syntactic forms and semantic roles. For 18, however, the census of binyanim is 
heavily concentrated in the Qal actives. The verbal stock consists of 5,277 Qal actives, 21 Niphal 
passives, 2 Hiphils, and 1 Qal passive. Knowledge of binyan assists only in the assignment of ob- 
jects of address (active predicators) and subjects of address (passive predicators). 


12.2 The CIC Subtype Census 


The census of the CICs is given in table 12.1. 


Table 12.1. CIC Subtype Census for the 1798 Verb Corpus—5,277 Structures 


Predicators Count GFs Count 
Finite verb 4217 || Subject 2.119 
Predicative inf. abs. I Direct object 4,960 
Nebulous 0 Purely verbal participle 51 Indirect object 1,912 
Syntactic Isolates Count | Noun-verb participle 46 Complement 2 
Vocative 29 Infinitive construct 962 Top Six SRs Count 
Exclamative 3 Non-predicative inf. abs. 6 Manner 532 
Label 10 Operators Count || Time point 110 
Negative 50 Quoter 109 
Closed interrogative 20 Location 77 
Other conjunction? 12 Aim /purpose 56 
Modal 8 Reference 26 


a. "Other conjunction” includes all CIC-level conjunctions other than those classified as underspecified “and.” 


12.3 Survey of the Non-GF/SR Clause Immediate Constituents 


$12.3 Main Observations: (1) The non-GF/SR clause immediate constituents appear 
relatively rarely in the NX corpus (in under 3% of the clauses). (2) The negatives im- 
mediately precede the predicators (48x) except in rhetorical questions (2x). 
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12.3.4 Impermanents 


12.3.1.1  Underspecified “and” Cue Phrases 

These impermanent CICs are excluded from the clausal analyses (see $9.3.1.1). 
12.3.1.2 Indeterminate Constituents 

Lapsii calami appear in Ruth 3:5 and 17, both of which are Qere welo? Kethivs. Each might 
eventually be restored to its Qere reading: "^os ‘to me’. 


12.3.2 Syntactic Isolates 


12.3.2.1 Vocatives 

Noun phrases are formally candidates for being declared vocatives, but alternatively they may 
have grammatical functions and semantic roles. Of the 30 vocatives in “NXN clauses, three follow 
exclamatives (Josh 7:8, Jer 14:13, and Ezek 21:5), almost half are clause initial (15x), and one is 
clause terminal (Ezek 11:5). Nearly half, all in Ezekiel, appear in the expression “Son of man, say 
tae (13x). 
12.3.2.2 Exclamatives 

Three exclamatives, each preceding some variant of mm "78 ‘my Lord Yahweh’, are in Josh 
7:8, Jer 14:13, Ezek 21:5. 
12.3.2.3 Labels 

There are 10 set phrases: 7177-DN3 ‘oracle of Yahweh’ or a variant.! Phrase marker (12.1) from 
Ezek 21:5 is an 98 clause containing both a vocative and an exclamative. 


(12.1) cl excl 
oblq S gram max ahah 
voctv — — , ndeity n:deity a 
gram app ———5 suf |. ^" ITS lords 
^ me 
m; Yahweh 
sh, 
gram man they(m) 
vb ptc 


—————————————— "DN saying 


ind obj to * pron b to 


gram prep lino NN 
W> me 


EI 
NS. sc Nb not 
ken ———— — —— 5wnn speaking in riddles 
pe D'»Un proverbs 
an ————————— :nin._ he 


1. The labels may be found in 1 Sam 2:30; Jer 1:15, 3:12, 23:33; Ezek 12:28, 13:8; Hag 1:13, 2:9, 2:14; Zech 1:3. 
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12.3.3 Predicators 

In addition to 4,217 finite verbs and 962 infinitives construct, we find one predicative infinitive 
absolute (Num 6:23), 51 purely verbal participles, and 46 noun-verb participles (three-quarters in 
the Prophets). We find six non-predicative infinitive absolute cognate-root *intensifiers."? 


12.3.4 Operators 


12.3.4.1 Negatives 

Of 50 negatives, only 2 do not immediately precede the predicator. In these cases (Gen 20:5 and 
1 Chr 21:17), a rhetorical question is being posed: intrg ... neg... sbj... vb. 
12.3.4.2 Closed Interrogatives 

There are 20 closed interrogatives.? Of these, 18 are clause initial. One follows a vocative (“son 
of man" in Ezek 12:9), and the other follows an indirect object (Job 34:31). Phrase marker (12.2) 
from 1 Chr 21:17 shows an “N8 clause containing both a negative and a closed interrogative. 


(12.2) a q closed "a yes/no 


oblq Ne gram : 
neg Nb not 


gram 
sbj : 
gram JN | 
vb . 
gram a ers [said 
aim to + inf cstr f 
— SS 
gram prep 5 to 
nom inf inf cst 
oblq > gram — n count 
dir obj in + humn ^ in 
gram prep 
xx the 


N DP people 


12.3.4.3 Other Conjunctions 

There are 11 noncoordinating conjunctions, 8 instances of 03 /D3 'also'^ and 3 of 48 ‘also’.> 
Two are clause internal, at Esth 7:2 and Neh 3:35. The others are clause initial and might have been 
excluded from analysis had we done discourse analysis and declared them cue phrases. 
12.3.4.4 Modals 

Modals occur 8 times: ^3 ‘surely’ (7x) and X ‘surely’ (1x) (Job 33:8). Phrase marker (12.3) 
from Gen 3:1 shows an X clause containing both an includer and a modal. 


2. They are in Exod 21:5; Judg 15:2; 1 Sam 2:30, 20:21; Jer 23:17; Ezek 28:9. 

3. There are also 35 open interrogatives. Seven are phrasal: 72/5 *to-what = ‘why?’ (6x) plus nnw ‘until when?’ 
= ‘how long?’ (2 Sam 2:26). 

4. These are in Judg 2:3, 17:2; 1 Kgs 1:48; Ruth 2:21; Esth 7:2; Neh 3:35, 4:16, 6:19. 

5. These are all assigned to the includer CIC class and are found in Gen 3:1, 2 Kgs 5:13, and Job 35:14. 
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(12.3) cl incl UR 
oblq SML gram 
die 73 surely 
gram 
ns ax he said 
gram 
sbj ~ 
gram D TON God 
obj addr cl meg yl 
gram oblq N gram N5 not 
vb . . 
gram Nn you(m) will eat 
dir obj from + veg ; 
gram prep n from 
n:veg " 
cstr 55 all of 
n:veg 
cstr P9 tree of 


N ngog ss e 
TELS garden 


12.4 Survey of the GF/SR Clause Immediate Constituents 


( si24 Main Observations: (1) One adverb predominates in the “N8 corpus: 73 ‘thus’ 
(93% of the adverbs and found in 10% of the clauses). (2) An 9X clause almost al- 
ways contains a speech (94%). (3) Three kinds of GF/SR classes are determined once 
the semantics of the substantives in their PPs are given: time, infinitive of utterance, or 
pronoun. (4) The 78 corpus exhibits a strong alternation in realizing indirect objects: 
5x (66%) versus 5 (34%). (5) Three prepositions account for 97% of the PP CICs: 5x, 
5, and 3. (6) The bare substantives are almost all subjects (96%). 


The focus of this subsection is to investigate the linkages between the functions of the CICs (their 
GF/SR labels) and their forms (as provided by the CIC daughter labels). The nature of the CIC 
daughter raw census is given in table 12.2. Each form will be taken up in its turn. 


Table 12.2. GF/SR Daughter Forms 


GF/SR Daughter Count 


Adverb 548 
Clause(-like) structure 4,881 


Prepositional phrase 2,310 
Noun phrase 2,052 
Pronoun 92 


12.4.4 Adverbs 
The CIC daughter adverbs are subdivided into three subsets based on semantics. We distinguish 
adverbs having temporal semantics, adverbs having spatial semantics, and other adverbs (“adverbs 
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of manner"). The table tallies the adverbs in 9X8 clauses. Both tN and nn have semantic role tm 
pt (time point), Tip ‘still’ is a tm int (time interval), and Tiy ‘again’ is a 4 times (number of times). 


Temporal Adverbs Spatial Adverbs Adverbs of Manner 
(tm pt, tm int, or # times) (loc) (mnr) 


ts then 7 nij here 1 n3 thus 
Tip still [tm int] 10 DU there 1 > very 
4 


Tip again [# times] NI [emphatic] 16 


nnp now 8 okna suddenly 1 


a. These appear mostly in the abundant “Thus said YHwuH.” 


12.4.2 Clauses and Clause-Like Structures 

The vast majority of NN clauses (93.5%) contain a speech consisting of a main clause, a sen- 
tence, or discourse unit(s). These structures exercise grammatical functions. The semantic role of 
the speeches is address. In our mixed representation, we label a speech obj addr/gram, object of 
address licensed by grammar, or sbj addr/gram, subject of address licensed by grammar. 

In 14 additional cases, the “speech” is preceded by a cue phrase, %3 (8x), "V (1x), or WR (5x).* 
Under these circumstances, we label the constructions cogv cmpl/cue (cognitive complement li- 
censed by a cue phrase). Phrase marker (12.4) from Job 36:10 shows the situation. 


(12. a dl seq w " and 


oblq N gram 
v» NN? he said 


gram 
cogv cmpl -- 
s > that 
N cl vb ; ; 
oblq ^" gram pav? they(m) will return 
mvt orig from + abst 'D from 
gram prep 
Tk evil 


Once, as shown in phrase marker (12.5) from Gen 12:13, we are forced’ to declare a cogv 
cmpl /no cue (cognitive complement licensed without a cue phrase). 


(12.5) ER x: DNN  thou(f) say 
NS haad NJ) [emphatic] 
CO O a — — — "A tt 
"5 me 
pail mR thou(f) 


6. For ^2, see Judg 15:2; Isa 3:10; Ps 118:2, 3, 4; Job 36:10; Qoh 5:5; 1 Chr 21:18. For "v, see Qoh 8:14. For Wr, 
see Ezra 2:63; Neh 7:65, 13:19, 13:22; 2 Chr 21:18— Late Biblical Hebrew? 
7. *Forced" because the pronouns in the clause preclude the embedded speech from being an object of address. 
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12.4.2.4 Speech-less ^ns Clauses 
It is interesting to consider the circumstances in which an NN clause lacks a speech, 343 
clauses out of 5,277 (6.596). Four phenomena account for the majority of these cases: 


]. 778 is the predicator in a nominalized clause, 

2. NX commands aim / purpose, 

3. an interrogative is used as a speech surrogate, 

4. usage of “NN is akin to the use of ^27. 

NN Is the Predicator in a Nominalized Clause. Sixty-three times (18.4% of the speech-less 
NN clauses) the antecedent is the virtual object of NX, as in phrase marker (12.6) from Gen 22:2. 


(12.6) ngeog — , n:geog DN one of 


app cstr 
Mun ee n:geog »7"» the 


w7 mountains 


"UN which 
X cl vb RI shall say 


oblq gram 
N ind obj to * pron "bw unto 


gram prep n ee £5 
S[ o thee(m) 


^N Commands Aim/!Purpose. Forty-eight times (21.4%), we label a prepositional phrase 
with ^5 plus an infinitive construct as an aim CIC.’ Phrase marker (12.7) from 1 Kgs 8:12 shows 
the behavior. 


(12.7 d  , sbj mm Yahweh 


oblq gram 
NN vb AX he said 
gram 7 
aim to + inf cstr r 
gram prep ? to 
nom inf inf cst 
oo 
oblq — gram pw dwell 
loc in + sbst " 
gram prep 3 in 
n:sbst © the 


‘Spay darkness 


8. Strictly speaking, this is a misnomer. The usual aim CIC specifies the purpose of the activity portrayed by a 
clause's predicator. Here, “dwelling in darkness" is not the purpose of the speech act. The speech act announces this 


purpose. 
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An Interrogative Is Used as a Speech Surrogate. Fourteen times, the open interrogative nn 
‘what’ is the surrogate for a speech that is being inquired about, as in phrase marker (12.8) from 


1 Sam 10:15. 
(12.8) cl dir obj 
oblq — open intrg iD 
vb 


— 198 


gram 


ind obj to + pron 
gram prep 


oh 


Tes dn 


sbj DRINY 


gram 


you(m) 


Samuel 


In phrase marker (12.9) from Lam 3:37, a demonstrative pronoun serves as the speech surrogate. 


(12.9) cl sbj 
oblq ie intg ^ '" who 
dir obj : 
gram — nr this 
vb f 
gram —— WAN hesaid 


Usage Is Akin to the Use of 119. After the foregoing explanatory mechanisms have been ex- 
ploited, 220 speechless clauses remain unaccounted for. They correspond quite closely to typical 


patterns in 37 clauses. Phrase marker (12.10) illustrates both Exod 19:25 and 2 Sam 21:2:° 


(12.10) cl dl seq w ^ 


oblq NN gram 2 
vb $s 
gram WN 
ind obj to + pron bN 
gram prep ‘ 


b :07 


The 337 version leads off 12 clauses. Fraternal twin—027 (12.11) appears once, in Gen 42:24. 


(12.11) cl dl seq w 


oblq gram = 
NS V L "um 


gram 

ind obj to * pron bg 

gram prep e 
Di 


9. With a concluding explicit subject, it also appears in 2 Chr 31:10. 


and 
he said 
unto 


them(m) 


and 
he spoke 
unto 


them(m) 
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12.4.2.2 Deep Embedding of Speeches 

Speeches embedded 5 levels deep occur 3 times, all in Jeremiah: 22:1—8, 27:2—5, and 36:28-31. 
Four-level embedding of speeches occurs more than 20 times. Phrase marker (12.12) for Ezek 
21:14 contains 4 levels of speech embedding. The fact that the most deeply embedded speech con- 
tinues on beyond the confines of the phrase marker is indicated by the rem / gram node (“remainder 
of address"), which hangs off the 4th obj addr / gram node. 


objadd — set — . cd voctv n humn z5 
(12. 12) gram union/dis obiq ^ ^ gram * cst = i2 sonof 


— DTS man 
Nà3in thou(m) prophesy 


»————————————————————— +") and 


Nod . b . . 
obiq N gram DON  thou(m) wilt say 
obj addr . d s mnr K 
gram obig N gram n3 thus 
vb 
N gram ——————— WRX he said 
sbj mdeity ' 
(rm * wf 7—7 VW lords 
NS . - 
oby adar cl vb n 
gram obiq gram —————————————————— nw thou) sày 
N objadar cl . 95 , nutnsi » Sm won 
gram oblq gram echo M NA 
| \ mn sword 
rem vb pf 
gram gram imon/dis) - mn she was sharpened 
N coord cj ^ and 


~ “Ds also 
myu plucked 
12.4.3 Prepositional Phrases 
The ^28 clauses contain 2,310 prepositional phrases. The incidences of 10 prepositions are tal- 
lied here: 


12.4.3.1 Temporal Substantive in a PP Is Par of a Time Point SR 

Thirty-three times a prepositional phrase involves a noun phrase with temporal semantics. Inde- 
pendent of the 4 different prepositions involved, in each case the dominating CIC is a tm pt (“time 
point”). Trimmed phrase marker (12.13) from Ruth 2:14 shows an instance of this phenomenon 
(see p. 178).'? 
12.4.3.2 An Infinitive of Utterance Is Always a Quoter SR 

There are 109 instances of 1x’9 ‘to say’ in AX clauses, each assigned the CIC SR qtr 
(“quoter”). Phrase marker (12.14) from Zech 4:13 illustrates the construction (see p. 178). 
12.4.3.3 PP with Pronoun Is Almost Always an Indirect Object 

A prepositional phrase involves a pronoun 912 times. The grammatical function is ind obj (*in- 
direct object") in all but nine cases (99%). Five of the exceptions involve 5» + [pronoun]. As we 
will see, 5v links fairly strongly to a refrnt (“referential”) role. Phrase marker (12.14) includes an 
indirect object involving a preposition plus a pronoun. 


10. To save space, we have omitted the speech in (12.13) at the end of the clause. 
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(12.13) a  ,diseqw 


oblq N gram s 
vb NN? he said 


gram 
ind obj to + pron ^ to 
gram prep ee 
no her 
 PR———K!"— ty. Boaz 
gram 
tm pt to * time ^ to 
gram prep d 
n:time time of 


n:food "1 the 
N 528 food 


12) RT em 


"? and 


oan ^nwN' she said 
ind obj to + pron x unto 
gram _ prep Tu | 
> me 

qtr — — , to*infcstr 4 

gram prep 


nom inf inf uttr 
obiq gram 


obj addr d "yes/no 


gram oblq SM gram 
neg Ni» not 


SAN say 


gram 

vb n 

gram nyT thou(m) didst know 
cogvempl —, c — , sbjcmp 

no cue obiq open intrg TW what 


e, AR 
oa TON these 


12.4.3.4 The 5x Prepositional Phrases 

The preposition 5x is very strongly linked to the indirect object grammatical function. It occurs 
1,281 times, dominated all but 5 times by an indirect object function node (99.7%). Table 12.3 re- 
ports the two main substantive types involved. 


Table 12.3. 7N-Phrase Substantive versus GFs/SRs 


substantive type 
[other] 
human 


[pronoun] 
[other] 


à. Thirteen of these are distributive. 
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Phrase marker (12.15) from Judg 9:14 has an indirect object involving 5x and a thornbush: 


(12.15) cd . ,diseqw 


oblq S. gram 1 and 
vb ss : 
M gram MAN  they(m) said 
sb — — , nveg - 
gram cstr 73 all of 
n:veg , 
def 7 we 
Oxy trees 
ind obj to + veg -dN unto 


gram prep TIU ee ; 
nved LÀ the 
b" TON  thornbush 


obj addr sent cl vb : 
gram ^ . * jut ^ * oblq N gram T? thou(m) walk 
N sbj 
N gram nnwN  thou(m) 
x oc vb -- i 
* aba" oram TYM thou(m) reign 


—————— "by upon 


gram prep ln cR 
53 us 


12.4.3.5 The ^5 Prepositional Phrases 

The preposition ^? is also strongly linked to the indirect object grammatical function. It occurs 
835 times, but 109 occur with infinitives of utterance (inf utt) as just discussed in $12.4.3.2, leav- 
ing 726 instances for tabulation. These are indirect objects about 9096 of the time, as table 12.4 
documents. 


Table 12.4. ^?-Phrase Substantive versus GFs/SRs 


substantive type 


[pronoun] 


other inf. construct 
[other] 60 1 18 


a. The indirect object in Isa 49:7 consists of an infinitive construct, a noun-verb 
participle, and a noun phrase. 


For variety, phrase marker (12.16) from Ps 139:20 shows an atypical 9X clause: (1) The in- 
direct object is a suffixed pronoun. (2) The clause contains no object of address (no speech). (3) It 
contains a semantic role of aim involving a PP with a noun having mental semantics. 
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(12.16) cl vb AN’ they(m) will say 
oblq gram E 
S ind obj q thee(m) 


gram 


aim to + mntl b to 


gram prop Ni RAN | 
mat plan 


Comparing tables 12.3 and 12.4, we observe that the NN corpus exhibits a much-discussed strong 
alternation in the realization of indirect objects. Indirect objects involve 98 (66.4%) twice as often 
as ^5 (33.6%). The nature of this alternation is briefly taken up in appendix 4, where we provide 
counterexample data suggesting that neither Jenni's “social distance" explanation nor Malessa's 
"separation from the predicator" explanation adequately accounts for the alternation phenomenon. 
12.4.3.6 The ’a Prepositional Phrases 

Prepositional phrases involving ^3 occur 115 times in 9X clauses, never as indirect objects. 
They are linked to location SRs 63 times (52.9%) and to time point SRs 46 times (38.7%). These 2 
SRs account for more than 90% of the instances of ^3 prepositional phrases in AX clauses. 


Table 12.5. '3-Phrase Substantive versus GFs/SRs 


substantive type 
l tm pt | [other] 


human 

other inf. construct 
body part 
geographic 
temporal 

[other] 


^» Ges 


a. These cases involve phrases such as “in the nations,” “in the people," and 
“in Israel,” this last because we have not (yet) resolved the homography of personal 
names, geographical names, and so on. 


The 34 location CICs involving body parts attract interest. Thirty consist of prepositional phrases 
of the “in my heart" variety, indicative of soliloquizing. Four involve speaking “in the ear of . . ." 
(Judg 17:2, Isa 49:20, Ezek 9:5, Job 33:8). 

Phrase marker (12.17) from Judg 17:2 shows a clause with an SR loc involving a body part and 
without an object of address. 
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(12.17) cl gam u? also 
oblq SNL gram s 
vb mN thou(f) didst say 
gram ae a 
loc in + body prt r= ; 
gram prep $4 ^ 
n:body prt L , ^w ears 
suff Ti 


^ me 


12.4.3.7 The Hemaining Prepositional Phrases 

The prepositions 5x, ^5, and ^3 appear in 96% of the prepositional phrases, leaving 81 unlabeled 
prepositional phrases. These involve 6 different prepositions. We give their tallies using the con- 
densed format: 


1. ‘>—There are 31 time point (tm pt) semantic roles and 6 comparison semantic roles. 

2. 5y—There are 21 refrnt (referential), 2 ind obj (indirect object), '! and 1 loc (location) ? 
semantic role. 

3. "?nN—There are 7 tm pt SRs and 1 aim SR. Three involve infinitives construct. 

4. nN—There are 5 marked direct objects: The direct objects refer to utterances: “all of words 
of Yahweh” (1 Sam 8:10) and “this word" (Jer 13:12, 14:17, 23:38, 31:23). 

5. ^n—There are 4 tm pt SRs (with temporal noun semantics: Gen 19:34, Exod 32:30, 2 Sam 
15:7, and Jer 13:6) and 1 ellipted cause (mental noun semantics: Deut 28:67 2x). 

6. 1»—There is 1 tm pt (2 Chr 35:25) and 1 open interrogative tm int (2 Sam 2:26). 

7. 125—There is 1 extensive aim constituent at Gen 37:22. 


12.4.4 Substantives 
The final constituents to be discussed are the “bare” substantives. ? In the total 5,277 Qal active 
NN clauses, there are 2,144 bare substantives. Table 12.6 provides tallies for the major categories. 


Table 12.6. Substantive versus GFs/SRs 


substantive type 


It is natural to wonder if there are principled methods for determining which function a given 
substantive should have. The default rule obviously is that a bare substantive in an NX clause is 
this clause's subject. This sort of blanket choice is correct 2,114 times in 2,144 rule applications 
(98.696). But what about the tail of 30 substantives (1.496) that do not function as subjects? 
Possible methods of identification come to mind, none of them foolproof. Consider using num- 
ber agreement as an aid. This is critical for identifying distributive subjects but fails where garden- 


11. A standard indirect object appears in 2 Kgs 22:8; a distributive indirect object is in Jer 23:35. 
12. In Ps 4:5. 
13. That is, substantives that are not part of prepositional phrases. 
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variety subjects are concerned. Two examples should make our case. In phrase marker (12.18) 
from Ps 118:3, number agreement is violated since a plural verb combines with a subject currently 


labeled singular. '^ 


(12.18) d vb 


“IWAN? letthem(m) say 


oblq S gram 
siad NJ [emphatic] 


gram z 
sbj n:bldg 


*3 that 


——————— Gp) to 


= "3 house of 


gram cstr Se, 
HIN Aaron 


oblq gram ^ ^ prep — 
Diy eternity 


loyalty 


gram s “ 
a 21 his/him 


In phrase marker (12.19) from 2 Sam 17:14, number agreement is violated since a singular verb 


combines with a plural subject. 


(12.19) a  ,diseqw 


gram 


NS pei "pw 
sbj n:humn DibU2N 

gram union/disj E C LÁ à 
N y 


n:humn al 
ostr 7) 


hw n:humn 
cstr Mn - 
ow 


obj addr cl sbj cmp 


gram oblq M gram nav 
sbj n:vocbl 
gram cstr m nsy 
n:humn 
win 
app N v 
n:humn " 
def q 
"3708 
cmpr from + vocbl "n 
gram prep 
n:vocbl 


14. See also Exod 32:12; Deut 9:28; 1 Kgs 20:28; Ezek 12:9, and 18:29. 


and 

he said 
Abshalom 
and 

all of 

men of 
Israel 
good 
counsel of 
Hushay 
the 

Arkite 
from 
counsel of 


Ahitopel 
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Unanchored VSO 36%| VO 17%} VIO 9%| OVS 7%| SVO 596! VS 4% 
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12.5 Clause Immediate Constituent Incidence across the anx Corpus 


We next take up the incidence of CICs across the 1728 corpus. Consider the bar chart at the top 
of p. 183 for the clauses making up the N corpus. Compared with the incidences of CICs across 
Biblical Hebrew shown in $11.2 (p. 162), 198 clauses use unanchored verbs half as often and 
anchored verbs twice as often. Direct objects (actually, objects of address) are twice as frequent, 
while indirect object and manner CICs are 4 times more frequent. Other than the formulaic manner 
CIC (75 ‘thus’), few semantic role CICs are present. 


12.6 Core Constituent Ordering 


12.6.1 Core Patterns for the anx Corpus 

We plot bar charts for ?28N clauses with anchored or unanchored predicators (see bottom, p. 183). 
The procedures for extracting the core patterns were specified in $11.3.2.2. Among the anchored 
predicators, the Other category is sufficiently large to attract scrutiny. On investigation, we find that 
it contains just over 20% of the corpus, its 2 largest contributors being VSIO, 15.7%, and VISO, 
4.7%—both being in the top ranks. 

Other for the unanchored predicators accounts for around 12% of its clauses, its 3 largest con- 
tributors being VSIO, 3.5%, SVIO, 2.7%, and IVO, 2.2%. 


12.6.2 SHs amid GFs 

The numerical entries in the table below disclose what percent of a given SR-group is inter- 
polated into the GFs of the clauses making up the NN corpus. For example, the table informs us 
that 4.1% of the manner SR tokens found in the NX corpus are interpolated amid the GFs of their 
clauses. 


# Qal Percent of SR-Groups Interpolated amid NN GFs 


Active Other Movement Enriching | Mixed level 
Root | Clauses} Participant | and Spatial | Temporal | Manner | Constituent| and DU 


In obtaining these figures, adjustments have been made: 


* Movement and Spatial. Of the 78 movement/spatial tokens, 53 are interpolated, 67.9%. 
But 50 of these occur immediately before an all-but-anchored object of address, leaving 3 
genuine interpolates, an incidence of 3/78 or 3.8%. 

* Temporal. Of the 121 temporal SR tokens, 29 are interpolated, 24.9%. But 23 of these 
occur immediately before an all-but-anchored object of address, leaving 6 genuine 
interpolates, an incidence of 6/121 or 5.0%. 

* Manner. Of the 532 instances of the manner SR, 24 are interpolated, 4.5%. But 2 of 
these occur immediately before an all-but-anchored object of address, leaving 22 genuine 
interpolates, an incidence of 22/532 or 4.1%. 

* Enriching Constituent. Of the 64 enriching constituent tokens, 36 are interpolated, 56.2%. 
But 28 of these occur immediately before an all-but-anchored object of address, leaving 8 
genuine interpolates, an incidence of 8/64 or 12.5%. 
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e Mixed Level and Discourse Unit. Of the 176 mixed-level/DU tokens, 2 are interpolated, 
1.1%. But both of these occur immediately before an all-but-anchored object of address, 
leaving no genuine interpolates. 


We count 144 interpolates if the pre-speech CICs are included, 39 interpolates if the pre-speech 
CICs are excluded. Hence, adjuncts are not interpolated among the GFs in the 3X8 corpus very 
often. The interpolation rate is 39/5,277 = 7.4 interpolations per 1,000 778 clauses. 


12.7 Brief Summary 


The following overview suggests that, syntactically, the “NN corpus exhibits what one might 
term “bureaucratic single vision." 

The Censuses. The predicators of NXN are overwhelmingly Qal actives (99.6%). Most predica- 
tors are finite verbs (80%), with “infinitive constructs of utterance” the next most common predica- 
tor (18%). 

Non-GF/SR CICs are rare. Omitting impermanent “ands,” they appear in 3% of clauses. Nega- 
tives are the most common, appearing in 1% of the clauses. 

With regard to GFs, explicit subjects appear in 40% of the clauses, speeches appear as objects 
of address in 94%, and indirect objects in 36%. Indirect objects involve 5N twice as often as they 
involve 5. There is no completely convincing explanation for this alternation. 

The adverb of manner 75 ‘thus’ occurs in just under 10% of the clauses. Three prepositions 
account for 96% of the PP CICs: 5 ,5x, and 1. The bare substantives are almost all subjects (97%). 
Bound pronouns are almost always indirect objects (97.296 of the time). Free pronouns are surpris- 
ingly rare, appearing in under 2% of the clauses, and all are subjects. 

CIC Incidence. “NN clauses use unanchored verbs half as often and both anchored verbs and 
infinitive constructs twice as often as is the case across Biblical Hebrew. Direct objects (actually, 
objects of address) are twice as frequent, while indirect objects and manner CICs are 4 times more 
frequent. The great majority of CICs exhibit grammatical functions. Other than a formulaic manner 
CIC (na ‘thus’), few semantic role CICs are present. 

Core Patterns. The “NXN corpus has a preference for clause-initial predicators (82.196). Of 
these, 9196 involve anchored predicators and could not be other than clause initial, but the other 996 
are unanchored. For the anchored cases, the dominant sequences are VO, followed by VIO and then 
VSIO. For the unanchored cases, the dominant sequences are VSO, followed by VO and then VIO. 
That these dominant sequences are object-of-address final is a reflex of the preference for having all 
but the shortest speech clause final. Intrusions of SRs into the X clause cores are rare, occurring 
at a rate of 7.4 intrusions per 1,000 clauses. 
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The distribution of rn binyanim is concentrated in the Qal: 3,546 are Qal and 19 are Niphal. 
Because the Niphals are so few, they are omitted from consideration here. 
The root mn appears in these three very different contexts: 


1. single predicators within clauses (2,798x), 
2. signalers of transitions in discourse (620x), 
3. auxiliary verbs in periphrastic constructions (128x).' 


We will discuss each of these distinct uses of mn verbs, but our prime focus will be on clauses 
where mn is the sole predicator. Signalers in discourse will be illustrated in $13.3, but their full 
exposition must await our planned volume on discourse. We introduce periphrastics in $13.4. 


13.2 Single Predicators in Clauses 


13.2.1 The CIC Subtype Census 
The census of the CIC subtypes is as shown in table 13.1. 


Table 13.1. CIC Subtype Census for the mn Corpus: 2,798 Structures 


Impermanents 


Predicators 


Underspecified “and” 


Finite verb 


Subject 


1,818 


Lapsus calami 


Predicative inf. abs. 


Direct object 


Exclamative 


Non-predicative inf. abs. 


Nebulous Purely verbal participle Indirect object 171 
Syntactic Isolates Noun-verb participle Complement 1,199 
Vocative Infinitive construct Top Six SRs Count 


Location 


Label 


Operators 


Possessor 


570 


Negative Comparison 
Closed interrogative Time point 256 
Other conjunction? Beneficiary 148 


Modal 


8 | Accompanier 


134 


a. “Other conjunction" includes all CIC-level conjunctions other than those classified as underspecified “and.” 


1. In one of the periphrastic constructions, both verbs have the root m7: Ps 50:21. 
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13.2.2 Survey of the Non-GF/SR Clause Immediate Constituents 


§13.2.2 Main Observations: (1) There are few non-GF/SR CICs in the 7 corpus; 
12.2% of the clauses contain them, and if the negatives are excluded, 3.3%. (2) As for 
the negative CICs, almost always they immediately precede the predicators (95%). 


13.2.2.1 Impermanents 

Underspecified “and” CICs. Excluded from the clausal analyses. 

Indeterminate Constituents. There are two /apsii calami, in Jer 50:29 and Lam 5:3. Both are 
Qere welo? Kethivs with the Qere readings n» ‘to her’ and 1 ‘and’. 

Nebulous. Two object markers, in 1 Kgs 11:15 and 25, and a noun phrase in Ezek 45:5. 
13.2.2.2 Syntactic Isolates 

Vocatives. There are 25 vocatives. One-quarter are clause initial (6x), 2 follow exclamatives 
(Neh 1:5 and 1:11), and one-third are clause terminal (9x). Almost half are in the Latter Prophets 
(11x) and one-third (9x) in the Writings. 

Exclamatives. There are 9 exclamatives. Three are clause-initial 039 or 74 in Gen 27:39, 30:34; 
and Josh 24:27; N3N occurs in Neh 1:5 and 1:11. 

Labels. There are 19 set phrases: M-03 ‘oracle of Yahweh’ or a variant. They are clause 
terminal 16 times and clause medial 3 times. 
13.2.2.3 Predicators 

Of the 11 non-predicative infinitive absolutes, there are 6 cognate-root intensifiers? and 5 non- 
cognate root amplifiers.? 
13.2.2.4 Operators 

Negatives. Of 258 negatives, 14 do not immediately precede the predicator. 

Closed Interrogatives. There are 12 closed interrogatives. 

Other Conjunctions. Fourteen are instances of 03 /0D3 ‘also’, 3 of ^» ‘that’ (Gen 21:30, Judg 
16:25, and Jer 48:27), and 1 of iN ‘or’, in Job 3:16. 

Modals. Modals occur 8 times: ^3 ‘surely’ 4x, J& ‘surely’ 2x (1 Sam 18:17 and Hos 12:12), 
1» ‘would that’ 1x (Gen 30:34), and jm7n, an optative-modal idiom often translated ‘would that,’ 
1x (Deut 5:29). See $15.3.4.4. 


13.2.3 Survey of the GF/SH Clause Immediate Constituents 


$1323 Main Observations: (1) Adverbs are few, and none predominates. (2) The 
identities of two kinds of PP CICs are determined once the semantics of their substan- 
tives is given: time and infinitive of utterance. (3) The ^»-phrases are roughly equally 
divided among the possessor, subject complement, and “other” SRs. (4) Excluding time 
noun phrases, most ^3-phrases involve locations (nearly 80%). (5) Nearly two-thirds of 
the bare substantives are subjects (6496), about three-tenths are subject complements 
dida and the rest are scattered over many SRs (6%). 


2. Gen 18:18; Num 30:7; 1 Kgs 13:32; Jer 15:18; Ezek 1:3, 20:32. Each immediately precedes its predicator, except 
in Ezek 20:32, which has an interpolated negative. 
3. 2 Sam 12:2; Qoh 2:7, 16; 11:8; and 2 Chr 32:27. All involve nz?n ‘to increase’. 
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This subsection investigates linkages between CIC functions (given by CIC GF/SR labels) and 
their forms (given by the labels on the CIC daughter nodes). The CIC daughters distribute as shown 
in table 13.2. Each type of form will be taken up in its turn. 


Table 13.2. GF/SR Daughter Forms 


GF/SR Daughter Count 
Adverb 164 
Clause(-like) structure 3 
Prepositional phrase 3,281 
Noun phrase 2,524 
Pronoun 191 


13.2.3.1 Adverbs 
The adverbs subdivide into three subsets. We distinguish adverbs with temporal semantics, ad- 
verbs with spatial semantics, and certain other adverbs (“adverbs of manner”): 


Temporal Adverbs Spatial Adverbs Adverbs of Manner 
(tm pt, tm int, # times) (loc) (mnr) 
then O meee (| 7| 


733 already 

Hm yet 

Tip still [tm int] 
Tiv again [# times] 
nny now 

mo not yet 


1 
nw there 43 wr together 2 
maw thither 3 n3 thus 3 


IRN very 4^ 
NJ [emphatic] 15 
pn only [rstr] 3 


a. The count includes 2 713 in Gen 15:5 and Isa 24:13 and 1 M22 in Neh 5:13. 
b. The count consists of 3 TRN and 1 7D (Exod 3:12). 


13.2.3.2 Clause(-Like) Structures 
There are three clause-like structures, each labeled a cognitive complement (cogv cmpl): Gen 
21:30, Judg 16:25, and Jer 48:27. 


13.2.3.3 Prepositional Phrases 

The mn clauses contain 3,281 prepositional phrases that are CIC immediate daughters. Each 
consists of two sisters: a preposition, and a substantive or substantive-like constituent. The CIC- 
dominated prepositional phrase in (13.1) from Judg 11:8 illustrates the concept (see p. 189). The 
CIC is a beneficiary (benf) dominating the prepositional phrase (to-humn/prep), which dominates 
the preposition 5 ‘to’ and the construct chain 7993 "32? 55 ‘all of dwellers of Gilead’. 
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(13.1) benf to + humn 5 


gram prep to 


T rhum — . | ,, 55 allof 
ML n:humn = "uv dwellers of 
Ty53 Gilead 


These are the prepositions that are involved: 


| 1,584 5 


636 a 


278 ’3| 180 ‘n] 171 | 168. dx] 133. ov] 57. w| 74 Misc. 


Temporal Substantive in PP Is Part of Time SR. Prepositional phrases involving nouns with 
temporal semantics occur 105 times. In all but 12 cases,* their dominating CICs involve time. 

Infinitive of Utterance Is Always Quoter SR. The identity of the preposition is usually needed 
to infer the GF or SR of a prepositional phrase. If, however, an infinitive of utterance or a substan- 
tive having temporal semantics is involved, then—as was the case for the “N8 corpus—the nature 
of the phrase of which they are part is thereby determined. There are 115 instances of 7x”? ‘to 
say’ in rn clauses, and each is assigned the CIC label qtr (quoter). Phrase marker (13.2) from Jer 
1:11 illustrates the situation: 


(13.2) cl dl seq w " 
oblq^ " gram ) and 


N "ud "T" he was 
hd — — "^31 word of 
=. Yahweh 


pos — e : em Ss unto 
cg c me 
Eu to — > to 
N pd -— pend TX say 
MCA e 
NS — —nmpw thou(m) 
ps nN^ seeing 
iam — nY Jeremiah 


The "5 Prepositional Phrases. The CIC-dominated prepositional phrases with ^? occur 1,584 
times. Table 13.3 documents how the “9 prepositional phrases are distributed across major GFs/SRs. 


4. In 10 of these cases, some form of the preposition ^3 ‘like’ is involved. 
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Table 13.3. ^5-Phrase Substantive versus GFs /SRs 


substantive type 
poss sbj cmp [other] 
[pronoun] 385 98 
human 135 122 88 
[other] 43 307 409 
Total 563 426 595 


The ‘3 Prepositional Phrases. As table 13.4 documents, prepositional phrases involving ^3 oc- 
cur 636 times in the mn corpus. They are linked to location SRs 404 times (64%) and to time points 
167 times (26%). These two SRs account for 90% of the “a prepositional phrases in rn clauses. 


Table 13.4. ’a-Phrase Substantive versus GFs/SRs 


substantive type 

loc tm pt [other] 
[pronoun] 80 10 
human 53 2 8 
other infinitive construct 43 1 
geographic 145 1 
temporal 110 10 
body part 43 5 
[other] 83 12 30 
Total 404 167 65 


Remaining Prepositional Phrases. The’) and ^3 PPs together account for 67.3% of the PPs. 
Remaining for GF/SR specification are 1,076 PPs, involving 6 “major” prepositions: 


1. ^3—The next most frequent PP involves ^3 ‘like’. We have classified these phrases as 
comparisons (cmpr) 260 times out of 278 occurrences (93.596). Twelve PPs that are not 
comparisons have temporal SRs. Example: nyn D?2 ‘as is now the case’, a time point 
[tm pt] in Deut 4:20. In our mixed representation, 5 have the GF of subject complement 
and 1 of subject. Example: pU13 PILI 1m ‘innocent and guilty fare alike’ in Gen 18:25. 

2. '5—There are 180 PPs involving ^n. More than half (97, 54%) are movement origins (mvt 
orig). The rest scatter across 16 CIC labels. 

3. 5p —Of the 171 phrases involving y, the substantial majority (134, 78.4%) are location 
(loc) CICs. The remaining 35 appear in 8 other CIC classes. 


5. Due to our mixed representation ($9.2), a CIC may correctly be assigned a GF in 1 context but improperly 
receive an SR in another. Here, the 2 PPs have their GFs specified (sbj and sbj cmp), but we could have called each a 
comparison (cmpr), albeit violating the constraint that assigning GF takes precedence over SR representation. 
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4. 5&— There are 168 PPs with 5x. Just over three-quarters are indirect objects (ind obj): 
pronominal 69 times, human 59 times, and divine once. The remaining 39 phrases 
comprise 7 CIC classes, with 29 involving location or orientation. 

5. ny — There are 132 accompanier (accmp) SRs involving ov. Twice, the preposition might 
be translated ‘beside’ (so BDB). In 2 Sam 24:16, the angel was “ny the threshing floor,” 
while in Deut 18:1 the priests have no territory “oy Israel." 

6. ty—If the nonprepositional constituent is an infinitive construct (13x) or has temporal 
semantics (30x), then the SR is time aim (tm aim) or time interval (tm int) 1x. The SR is 
movement aim (mvt aim) 13x.’ 


13.2.4 Substantives 


The final constituents to be considered are the 2,715 “bare” substantives.* Of the total 2,798 
single-predicator mn clauses, 2,078 clauses (74%) contain one or more bare substantives. They 
distribute as shown in table 13.5. 


Table 13.5. Substantives for the mn Corpus 


substantive type 
subj | subj cmp [other] 
human 373 241 15 
vocable 193 15 
[other] 1,240 492 146 
Total 1,806 748 161 


13.3 mn Verbs as Discourse Transition Markers 


The discourse transition sequences 7771 ‘and he was’ (376x) and 77’) ‘and he will be’ (167x) 
are dominated by cue/cue nodes? in our representation, as in phrase marker (13.3) from Ruth 3:8. 
The two segments form a compound cue phrase.'° We find this phenomenon 546 times (see PM 
(13.3), p. 192).!! We defer discussion of the way these cue phrases function until we address dis- 
course analysis in chap. 21. For now, we observe that in three-quarters of the contexts with these 
cue phrases, the first CIC in the following clause has a semantic role of time point (tm pt/gram). 
One of the tasks for discourse analysis will be to decide on the proper scoping of these time- 
specifying CICs. Over how much of a following narrative do they “hold sway"? 


6. On this phenomenon, see E I. Andersen and D. N. Freedman, Hosea (AB 24; Garden City: Doubleday, 1980) 
149-51. 
7. There are five exceptions: interrog.: Exod 10:7, Neh 2:6; manner: 2 Sam 2:17, 1 Kgs 18:45; result: 1 Kgs 17:17. 
8. That is, substantives that are neither part of prepositional phrases nor nominalized entities. 
9. In addition to the standard pair of markers, there are three oddities: 2 Sam 5:24, 1 Kgs 14:5, and 1 Chr 14:15. 
10. Van der Merwe, Naudé, and Kroeze, A Biblical Hebrew Reference Grammar, 331. Also Waltke and O’Connor, 
IBHS, 54 n. 24, 634; and M. O’Connor, “Discourse Linguistics and the Study of Biblical Hebrew,” in Congress Volume: 
Basel, 2001 (ed. A. Lemaire; VTSup 92; Leiden: Brill, 2002) 18. 
11. See C. van der Merwe, “The Elusive Biblical Hebrew Term ^n": A Perspective in Terms of Its Syntax, Se- 
mantics, and Pragmatics in 1 Samuel,” HS 40 (1999) 83-114. Also K. Jongeling, “‘And It Came to Pass’ Again, " in 
Memoriae Igor M. Diakonoff (ed. L. Kogan et al.; Babal und Bibel 2; Winona Lake, IN: Eisenbrauns, 2005) 291—329. 


192 Chapter 13 13.4 
(13.3) unt ^ cue — , diseq w qe 
cue cue x gram 
` b vb 
\ ` gram "" he was 
\ cl tm pt in + time 
oblq gram prep 3 in 
N nime ‘yn — half of 
cstr : 
n:time ; 
de ——— ^ 2» the 
n» night 
diseq —— 5^ and 
gram 
T T? he shuddered 
gram d 
shh, nhumn " 
gram def "^ the 


13.4 Periphrastics 


man 


Consider phrase marker (13.4) from Nah 3:11, where the predicator results from compounding 
nni "nn ‘do thou(m) be ... hidden’, which is licensed by the relation inverse modification (inv 


mod). 
(13.4) cl vb ptc vb ptc - 
oblq gram inv mod "n 
np 
sbj pers pr “D3 


gram inv mod 


n 


do thou(f) be 
hidden 
also 


thou(f) 


13.5 Clause Immediate Constituent Incidence Contours 


Examining the bar chart for the clauses making up the 7n corpus, we see that, in comparison 
with the overall incidences shown in $11.2, the 77 clauses use unanchored predicators slightly 
more often (60% versus 54%) and use anchored predicators slightly less often (40% versus 46%). 
Explicit subjects are more than half-again more frequent, while direct objects are nonexistent. /n- 
direct objects are 60% as frequent. The noun-verb participle does not occur. New to the party are 
possessors, comparisons, time points, and beneficiaries. The possessor semantic role occurs more 
than 10 times as frequently with mn as with any of the other clauses of Biblical Hebrew. Compar- 
isons are 3 times more frequent. Time points are almost twice as frequent. Beneficiaries are nearly 


twice as frequent. 
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CIC Incidence for mn Clauses 


| 
o] cll 
c 


100 
j 


80 


40 


20 


anchored verb 
inf. construct 
n-v participle 
unanchored verb 
verb participle 
subject 

direct object 
indirect object 
subj+obj compl 
location 
negation 
movement aim 
movement orig 
possessor 
comparison 
time point 
beneficiary 


13.6 Core Constituent Ordering 


13.6.1 Core Patterns for the mn Corpus 


We plot bar graphs for mn clauses with anchored and unanchored predicators. The procedures 


for extracting the core patterns were specified in $11.3.2.2. 


m Core Patterns. Anchored Core Patterns. Unanchored 
v NENNEN vo ll | 
vs NENNEN ‘SS 
vo | vo | 
V | v | 
vc vc BE 
sv | sv i 
Ov OV 
IV IV 
cv cv aM 
Vso vso 
vs a vsi E 
vse MENENENENENENENENEN vsc mI 
vos vos 
vol vol 
voc voc 
vis VIS 
vio vio 
Svo SVO 
OVS OVS 
other Oh J| 
T T T - T T T - T 
0 10 20 30 40 50 60 0 10 20 30 40 50 


60 
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The table shows the sequence patterns for both anchored and unanchored predicators. 


| Predicator Type | Rank #1 Rank #2 Rank #3 Rank #4 Rank #5 | Rank #6 | 
| Anchored vs | 29% | VC | 24% VSC | 24% | V | 1396 VSI | 7% | VCS 1% | 
Unanchored VS | 23% | v | 18% | SV | 17% SVC | 11% | VC | 7% | CV | 7% 


13.6.2 SRs Amid GFs 


The numerical entries in the table below disclose what percent of a given SR group is interpo- 
lated amid the GFs of the clauses that compose the mn corpus. ? 


# Qal Percent of SR-Groups Interpolated amid the mn GFs 
Active Other Movement Enriching | Mixed level 
Root Clauses | Participant | and Spatial | Temporal | Manner | Constituent | and DU 
mn 2,798 41.1% 17.8% 8.0% 21.9% 29.7% 7.3% 


The specifics underlying these figures are: 


* Other Participant. Of the 917 Other Participant tokens, 377 are interpolated, an incidence 
of 377/917 or 41.1%. 

e Movement and Spatial. Of the 859 Movement and Spatial tokens, 153 are interpolated, 
17.896. 

* Temporal. Of the 439 Temporal SR tokens, 35 are interpolated, 8.0%. 

* Manner. Of the 73 instances of the Manner SR, 16 are interpolated, 21.9%. 

* Enriching Constituent. Of the 155 Enriching Constituent tokens, 46 are interpolated, 
29.7%. 

* Mixed Level and Discourse Unit. Of 524 aim and comparison tokens, 38 are interpolated, 
7.3%. 


We count 665 interpolates. The “interpolation rate" is 238 interpolations per 1,000 mn clauses. 


13.7 Brief Summary 


We parse mn predicators as: single predicators in clauses (79%), discourse transition signalers 
(17%), and auxiliary verbs in periphrastics (4%). 

The Censuses. The predicators of mn are overwhelmingly Qal actives (99.5%). When they 
are the sole predicators in clauses, most are finite verbs (9596), with infinitives construct next most 
likely (5%). Non-GF/SR CICs are uncommon (in 12% of clauses), with negatives being the most 
frequent (in 9% of the clauses and immediately pre-predicator in 94% of occurrences). Regarding 
GFs, explicit subjects are in 65% of the clauses and subject complements in 43%. SRs are com- 
mon. The two most frequent (location and possessor) are in 4596 of the clauses. Adverbs are SRs 
in 6% of the m7 corpus. The most common prepositions, 9 and 23, account for two-thirds of the PP 
SRs. The preposition 9 appears in possessor (36%), subject complement (27%), and “other” (37%) 


12. Readers wishing to be reminded of the way we group the SRs should look inside the back cover. 
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SRs. The preposition 3 appears in locations (64%) and in time points (26%). Bare substantives are 
mainly subjects (67%) and subject complements (28%). 

CIC Incidence. wn clauses use unanchored verbs slightly more often and anchored verbs 
slightly less than is the case across Biblical Hebrew. Relative to all of Biblical Hebrew, explicit 
subjects are 17 times more frequent, possessors are 10 times more frequent, comparisons 3 times 
more frequent, and fime points and beneficiaries approximately 2 times more frequent. 

Core Patterns. Of the 6 most frequent sequences in both anchored and unanchored mn predi- 
cators, only 2 are not predicator initial (unanchored SV, 16%, and unanchored SVC, 10%). For the 
anchored cases, the top 3 dominant sequences are VS, followed by VC and then VSC. For the unan- 
chored cases, the top 3 dominant sequences are VS, followed by V and then SV. Intrusions of SRs 
into the m7 clause cores are quite frequent, occurring at a rate of 238 intrusions per 1,000 clauses, 
32 times more frequent than the rate for N8. 


Chapter 14 


The NWY Corpus 


14.4 The Binyan Census 


There are 2,611 predicators with the root nwy in Biblical Hebrew. Of these, 80 are elliptic, leav- 
ing a total of 2,531 actual predicators. The remaining 2,531 instances involve 2,416 Qal actives, 
15 Qal passives, 98 Niphal passives, and two Piel actives.' In this chapter, we study only the Qal 
actives. 


14.2 The CIC Subtype Census 


The distribution of the CICs for the 2,416 Qal active nwy clauses are shown in table 14.1. 


Table 14.1. CIC Subtype Census for the nwy Corpus: 2,416 Structures 


Impermanents Count Predicators Count GFs 
Underspecified “and” Finite verb 1,966 | Subject 
Lapsus calami 2 | Predicative inf. abs. 5 | Direct object 1,484 
Nebulous 2 | Purely verbal participle 97 | Indirect object 359 

Syntactic Isolates | Count | Noun-verb participle 
Vocative 20 | Infinitive construct 297 
Exclamative 4 | Non-predicative inf. abs. 11 

Label 5 Operators 
Negative 143 | Manner 


Complement 


Top Six SRs 
Location 


Comparison 


Closed interrogative 13 | Beneficiary 


Other conjunction 12 | Time point 118 
Modal 9 | Aim/purpose 87 


14.3 Survey of the Non-GF/SR Clause Immediate Constituents 


$14.3 Main Observations: (1) Non-GF/SR CICs are infrequent in the nwy corpus. 


They appear in 8% of the clauses, and if the negatives are excluded, in just over 2.5%. 
(2) As for the negative CICs, usually they immediately precede their predicator (88%). 


1. Two Piels, in Ezek 23:3 and 8, having the sense of ‘press, squeeze (unchaste act)’ (BDB), are omitted. 
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14.3.1 Impermanents 

Underspecified “and” is omitted from the analysis, as are two lapsii calami (in 2 Kgs 19:31 and 
Ezek 9:11), both Qere welo? Kethivs, and two nebulous CICs, both in 2 Sam 7:23. 
14.3.2 Syntactic Isolates 


14.3.2.1 Vocatives 
Of the 20 vocatives, 6 are clause initial, 7 clause final. There are 9 in the Latter Prophets. 


14.3.2.2 Exclamatives 

Clause-initial ANN occurs before vocatives in 2 Kgs 6:15 and Ezek 11:13. 
14.3.2.3 Labels 

There are 5 set phrases, variants of 717-083 ‘oracle of Yahweh’. They are clause terminal 4 
times (Jer 19:12; Ezek 12:25, 36:32, 37:14) and clause medial once (Jer 5:18). 
14.3.3 Predicators 

Most predicators in the nwy corpus are finite verbs (82%), with infinitives construct second- 
most frequent (12%). There are 10 cognate-root infinitives absolute (intensifiers) in the nwy 
clauses. There is 1 non-cognate root infinitive absolute (amplifier), in Qoh 12:12, involving 1277 
‘to increase’. 


14.3.4 Operators 


14.3.4.1 Negatives 
Of 143 negatives, 126 immediately precede their predicators (88%). 


14.3.4.2 Closed Interrogatives 
There are 13 closed interrogatives.? 


14.3.4.3 Other Conjunctions 

There are 12 noncoordinating conjunctions. Nine are 03 /D3 ‘also’ and 3 48 ‘also’. 
14.3.4.4 Modals 

There are these 9 modals: ^3 ‘surely’ 3x, °218 ‘perhaps’ 4x, and 728 ‘surely’ 2x. 


14.4 Survey of the GF/SR Clause Immediate Constituents 


$14.4 Main Observations: (1) Adverbs are few, with 72 ‘thus’ predominating (168x). 
(2) The identity of one PP CIC is determined by the semantics of its substantives, time. 
(3) The -phrases are divided among the indirect object (47%), beneficiary (26%), and 
"other" SRs (27%). (4) Excluding time noun phrases, two-thirds of 1-phrases involve 
locations. (5) Of the bare substantives, 33% are subjects and 54% are direct objects. 


2. In addition, there are 10 phrasal open interrogatives: nn» ‘to what’ = ‘why?’ (4x) and nn» ‘until when’ = 
‘how long?’ (3x), n/a ‘in what?’ (2x), and nn nnn ‘under what?’ (1x). 
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This subsection investigates linkages between CIC functions (given by GF/SR labels) and their 
forms (given by CIC daughter labels). The CIC daughters’ distribution is shown in table 14.2. 


Table 14.2. GF/SR Daughter Forms 


GF/SR Daughter Count 
Adverb 211 
Clause 7 
Prepositional phrase 2,103 
Noun phrase 1,594 
Pronoun 291 


14.4.1 Adverbs 

The 211 adverbs are subdivided into 3 subsets on the basis of semantics. We distinguish adverbs 
having temporal semantics (temporal adverbs), adverbs having spatial semantics (spatial adverbs), 
and certain other adverbs (adverbs of manner). Here are the adverbs in nwy clauses: 


Temporal Adverbs 
(tm pt, tm int, # times) 
Tip still [tm int] 4 
nnp now 4 

| 
l 
2 


Spatial Adverbs 
(loc) 


Adverbs of Manner 
(mnr) 


NI [emphatic] 5 
xia then 2 
Y» together 
okna suddenly 


nwbn farther? 
733 already 
Tip again [# times] 


p | — 


a. Usually spatial, in Ezek 43:27, the sense has “leaked” into the temporal domain. 


14.4.2 Clauses 

There are 7 cognitive complements, each involving questioning: Gen 20:9, Exod 14:5, Num 
22:28, 1 Sam 20:1, and Jer 4:30 (3x). See §8.2.2. Phrase marker (14.1) from Exod 14:5 illustrates 
(p. 199). 


14.4.3 Prepositional Phrases 


The nwy clauses contain 2,118 prepositional phrases involving 16 prepositions: 
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(14.1) d __| objcmp "120 what 


oblq NAA intrg s 
aro nNt this 


gram 
vb aet 
gram Pwy we made 
cogv cmpl - that 


cue 


cl vb 


oblq gram 
M dir obj obj mk + humn -n [nota acc.] 


ON? Israel 


undes from + inf cstr 
——— 


unvv we sent 


—————————— , 
gram prep D from 
nom inf X , infcst r 
oblq — gram Ty serve 
dir obj oe ars 
gram ^ ^ 32 us 


14.4.3.1 A Temporal Substantive in a PP Is Part of a Time SR 

The preposition usually identifies the GF/SR of a prepositional phrase, but one of the substan- 
tive categories is determinative (temporal semantics). Prepositional phrases involving nouns having 
temporal semantics occur 92 times. In all cases,* their dominating CICs involve time. 
14.4.3.2 The ^5 Prepositional Phrases 

Prepositional phrases involving ^ occur 727 times. The most common categories are indirect 
object (ind obj) and beneficiary (benf)— nearly three-quarters of the prepositional phrases that are 
CICs. 


Table 14.3. ’9-Phrase Substantive versus GFs/SRs in the nwy Corpus 


So type 
S ener 


Tm 


human 


It is important to understand the natures of these two categories. The first (ind obj) is a GF, 
while the second (benf) is an SR. They are not mutually exclusive. Indeed, in a full representa- 
tion of the grammatical structure of a clause ($9.2.2), a constituent may concurrently have a GF 
of indirect object and an SR of beneficiary, as in phrase marker (14.2) from 1 Sam 19:5. The final 
CIC is correctly assigned the GF of indirect object. But from a semantic perspective, the constitu- 
ent may well have the SR of beneficiary. Indeed, we classify the constituent 5N3277722 ‘to all of 


3. The semantic roles of 3 temporal CICs are masked by the fact that, because of our mixed representation strategy, 
they are labeled direct objects: Exod 31:16, Deut 5:15, and Esth 9:19. 
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Israel’ as a beneficiary in phrase marker (14.3) from 1 Chr 29:21 (p. 201). This raises an important 
question: did the human over-reader adhere to the mixed-representation policy of applying a GF 
category wherever possible in preference to the SR? Since our awareness of the distinction between 
grammatical and semantic functions only emerged after we were well into the process of assign- 
ing categories, the candid admission must be made that the policy could not have been adhered to 
strictly. This fact makes it all the more important that we implement the full representation as soon 
as possible. 


(14.2) a vb wy’ he made 


oblq SS gram y. 
ej mm Yahweh 


gram 


dir obj n:abst -— ; 
gram — app gp cc d salvation 
mix big 


ind obj to + humn b 
gram prep i 


iy ip up— GÀ BA P 


estr i | 
ow," Israel 


14.4.3.3 The ’a Prepositional Phrases 

Prepositional phrases involving ^3 occur 436 times in ny Qal active clauses. They are linked 
to location SRs 259 times (59.4%) and to time SRs 70 times (16.1%). Hence, these 2 SRs account 
for almost 76% of the instances of “a PPs in nwy clauses. 


Table 14.4. '3-Phrase Substantive versus GFs/SRs 


substantive type 
l loc 


body part 103 
geographic 


[other] 


temporal 46 
[other] 95 24 101 
Total 


This is a good place to point out a very common situation, in which information from a nearby 
clause is required in order properly to assign a semantic role. How does one determine the roles 
associated with PPs involving pronouns? Our database does not yet encode chains of referential 
cohesion. That is, anaphors and cataphors are not yet indicated. Hence, in an embedded obliqueness 
structure such as in (14.4) from Exod 35:2 (p. 202), the SR of i3 ‘in him’ is not accessible from the 
clause context. 
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(14.3) d  ,diseqw ^ 


oblq N gram 
vb by? 


gram 
obj cmp ts 
an niby 
ind obj to + deity % 
gram prep ° 
mm 
tm aim to * time b 
— , 
gram prep E f 
n:time 
cstr mnn 
N n:time n:time "à 
app def A " 
p 
adj , 
def bu ] 
gin 
dir obj n:anml n:anml 


gram mix list mod Bs 


benf to * humn d 
— 
gram prep f 


and 
they(m) brought up 
holocausts 
to 

Yahweh 

to 
tomorrow of 
the 

day 

the 

he 

bulls 
thousand 
rams 
thousand 
lambs 
thousand 
and 
libations 
them(m) 
and 
sacrifices 
to 
multitude 
to 

all of 


Israel 
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(14.4) nom ptc n-v ptc 
obiq ^ " gram 


227 in + pron og 
NM p —— "1 In 


nwy making 


22? prep 

n tem his/him 
dir obj 
oum TINN work 


Clearly, we must work out what the anaphor (or cataphor) of ‘him’ is. To do this, we require access 
to the clause before the clause in which the nominalized participle is embedded. There, we see that 
the anaphor of ‘him’ is wawy n?’ ‘the day the seventh’. This being the anaphor, we infer that i3 
must have a semantic role involving time. 
14.4.3.4 Nearly Invariant Prepositional Phrases 

Some prepositions are predominantly associated with a single GF or SR, with very few excep- 
tions. These are: NX ‘(object marker)’ with 99.7% dir obj,* ^3 ‘like’ with 97.4% cmpr,? and D09 
‘with’ with 89.5% accmp. In table 14.5, we list the prepositions and their preponderant substantive 
objects. 


Table 14.5. Nearly Invariant Preposition GFs /SRs 


substantive type 


cmpr accmp 


[pronoun] 
human 19 12 24 
[nominalized] 
vocable 66 46 
[other] 
Total 380 308 86 


14.4.3.5 The Hemaining Prepositional Phrases 

To this point, we have presented the prepositional phrases associated with 5 different preposi- 
tions. These account for 91.4% of the PPs. Left for specification are 181 PPs, involving 5 “major” 
prepositions plus a miscellany of 6. We give their tallies in a condensed format: 


1. ^ —There are 67 PPs involving ‘A and its variants. Nearly half (28, 42%) involve 
position: location (loc) 5x, movement interval (mvt int) 1x, or movement origin (mvt orig) 
22x. The rest are scattered across a surprising 11 CIC labels (this scattering is quite similar 
to what we found in the mn clauses). 

2. by— There are 56 PPs involving 5v and variants. Most (46, 8296) involve location (loc). 
The other 10 involve rsn (5x), accmp (1x), refrnt (3x), and rslt (1x). 


4. There is a subject with an object marker in Neh 9:34. 
5. Seven out of the 8 noncomparatives are "approximate time" specifiers. 
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3. 197—There are 16 instances of prepositional phrases involving 19227. They appear as 
reason (8x), result (3x), beneficiary (3x), and aim (2x). 

4. ty—There are 11 instances of prepositional phrases involving T9. 

. nnn—- There are 9 instances of prepositional phrases involving nnn. 

6. Miscellaneous. The other 6 prepositions are: ON ‘unto’ (3x), DN ‘afterward’ (2x), pa 
‘between’ (7x), 123 ‘through’ (1x), "ava ‘for the sake of’ (6x), 729 ‘except’ (3x). 


CA 


14.4.4 Substantives 
The final constituents to be tabulated are 1,885 bare substantives in the Qal active ny clauses. 
Approximately two-thirds contain one or more bare substantives. 


Table 14.6. Bare Substantives in the nwy Corpus 


substantive type 


sbj dir obj [other] 
human 316 38 n 
quality 8 140 6 
deity 102 17 | 2 
pronoun 6 


utensil 3 
[other] 54 
Total 629 


14.5 Clause Immediate Constituent Incidence Contours 
Below is the bar graph of CIC incidence for the Qal active nwy corpus. 


CIC Incidence for Qal nwy Clauses 
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With Qal active nwy, unanchored finite verbs are preferred. An explicit subject occurs in one-third 
of the clauses, a direct object in nearly two-thirds, a location in about 1596, and movement rarely. 


14.6 Core Constituent Ordering 


14.6.1 Core Patterns for the Qal Active nwy Corpus 


There are 940 nvy clauses for which the predicator is anchored. Of the 1,476 clauses with un- 
anchored predicators, 853 are clause initial, and 623 are clause medial or final. Hence, 58% of the 
unanchored predicators are clause initial. 


nwy Core Patterns. Anchored 


OVS | 


Other Es 


0 10 20 30 40 50 60 0 10 20 30 40 50 60 


The table shows the sequence patterns for both anchored and unanchored predicators. 


Rank #6 
VOC | 3% | 


Predicator Type 
Anchored | 940 | VO | 54% | V | 15% 
Unanchored | 1,476 | V 


VS | 8%| VSO | 7% | VI | 4% 


For the anchored predicators, the majority of the clauses have the VO core pattern (5496) with very 
few OV patterns (0.396). Instances of the VSO pattern significantly outnumber those with the SVO 
pattern (7% versus 0.3%). For the unanchored predicators, many of the clauses have the VO core 
pattern (17%), but there also are many with the OV pattern (13%). Instances of the VSO pattern lag 
somewhat behind those with the SVO pattern (3% versus 6%). 
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14.6.2 SHs amid GFs in the Qal Active nwy Corpus 
SRs are interpolated amid the GFs in the nwy corpus as shown in the table.^ 


# Qal Percent of SR-Groups Interpolated Amid ny GFs 


Active Movement Enriching | Mixed level 


Clauses | Participant | and Spatial | Temporal Constituent 
2,396 12.1% 10.2% 8.1% 


The specifics underlying these figures are: 


e Other Participant. Of the 295 Other Participant tokens, 143 are interpolated, an incidence 
of 143/295 or 48.5% (beneficiaries account for most of the intrusion). 

* Movement and Spatial. Of the 414 Movement and Spatial tokens, 50 are interpolated, 
12.1% (location CICs account for the vast majority). 

* Temporal. Of the 196 Temporal SR tokens, 20 are interpolated, 10.2%. 

* Manner. Of the 201 instances of the manner SR, 33 are interpolated, 16.4%. 

e Enriching Constituent. Of the 160 Enriching Constituent tokens, 13 are interpolated, 
8.1%. 

* Mixed Level and Discourse Unit. Of 426 aim and comparison tokens, 11 are interpolated, 
2.6%. 


We count 270 interpolations. The “interpolation rate" is 113 interpolations per 1,000 nwy clauses. 


14.7 Brief Summary 


The Censuses. The predicators of ny are predominantly Qal actives (95%), there being a re- 
siduum of Niphal passives (496). Most predicators are finite verbs (8196), with infinitives construct 
the next most common predicator (12%). Non-GF/SR CICs are infrequent (in 9% of clauses), 
with negatives being the most common (in 6% of the clauses). Regarding GFs, explicit subjects 
appear in 26% of the clauses, direct objects in 61%, and indirect objects in 15%, mostly involving 
5 (95%). The adverb of manner 72 ‘thus’ occurs in 796 of the clauses. Three prepositions account 
for three-quarters of the PP CICs: 5, 3, and nx. The bare substantives are mainly direct objects 
(5496) or subjects (3396). Bare substantives are usually subjects when they have human semantics 
(88%). Bare substantives are often direct objects when they have utensil semantics (71%) or quality 
semantics (91%). 

CIC Incidence. Comparing relative frequencies: NWy clauses use unanchored verbs a bit more 
often and anchored verbs a bit less often than is the case across Biblical Hebrew; there are about 
70% as many subjects, about one-quarter more direct objects, and 40% more indirect objects. 

Core Patterns. For anchored nivy predicators, the majority of the clauses have the VO core 
pattern (54%). The VSO pattern is significantly more frequent than the SVO pattern (7% versus 
0.396). For the unanchored predicators, the VO core pattern is not uncommon (17%), but OV is also 
not uncommon (13%). 


6. Note: the number of clauses being considered is reduced by 20 clauses because we have excluded clauses with 
compound predicators to simplify the discussion. 
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With 113 SR intrusions per 1,000 clauses, the nwy corpus lies midway between the sparse in- 
trusion occurring in the NX corpus (7.4/1,000) and the rampant intrusion found in the 777 corpus 
(238/1,000). 


Chapter 15 


The {ni Corpus 


15.1 


There are 1,992 instances of the root 7n3: 1,896 Qal actives, 8 Qal passives, 7 Piel passives, and 
81 Niphal passives. We characterize the Qal actives. 


The Binyan Census 


15.2 The CIC Subtype Census 


The 1,896 clauses with Qal active [1 exhibit the distribution of CICs shown in table 15.1. 


Table 15.1. CIC Subtype Census for the (n3 Corpus: 1,896 Structures 


Impermanents Count GFs Count 
Underspecified “and” | 703 | Finite verb 1,626 | Subject 520 
Lapsus calami Predicative inf. abs. Direct object 1,521 
Nebulous Purely verbal participle Indirect object 972 

Syntactic Isolates Noun-verb participle Complement 210 
Vocative Infinitive construct Top Six SRs Count 
Exclamative Non-predicative inf. abs. Location 587 
Label 11 Operators Count | Aim/purpose 146 

Negative 109 | Movement aim 95 
Closed interrogative 8 | Comparison 83 
Other conjunction ° 12 | Time point 75 
Modal 2 | Movement origin 4- 


a. “Other conjunction” includes all CIC-level conjunctions other than conjunctions that are classified as under- 


specified “and.” 


15.3 Survey of the Non-GF/SR Clause Immediate Constituents 


$15.3 Main Observations: (1) There are relatively few non-GF/SR CICs in the jn3 
corpus. (2) Usually the negatives immediately precede the predicators (95.2%). (3) Many 
negatives are clause initial or follow an underspecified “and” (69%). 
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15.3.4 Impermanents 
Underspecified “and” is excluded from the analysis. There is one nebulous CIC, in Ezek 27:18. 


15.3.2 Syntactic Isolates 
15.3.2.1 Vocatives 

Of 12 vocatives, 3 are clause initial, 1 postexclamative (1 Kgs 3:26), and 3 clause final. 
15.3.2.2 Exclamatives 

Four exclamatives appear, in 1 Kgs 3:26; Ezek 16:23, 21:20; and Hos 11:8. 


15.3.2.3 Labels 
There are 11 labels, 4 in Jeremiah (all are 1177-083 ‘oracle of Yahweh’) and 7 in Ezekiel (all are 
mm YTN O82 ‘oracle of my Lord Yahweh’). 


15.3.3 Predicators 

The finite verbs dominate (8696), with infinitives construct next (896). Six infinitives absolute 
are cognate-root intensifiers, and 2 are noncognate root amplifiers. ' 
15.3.4 Operators 


15.3.4.1 Negatives 
Five of 109 negatives do not directly precede their predicators. 


15.3.4.2 Closed Interrogatives 
There are 8 closed interrogatives. 


15.3.4.3 Other Conjunctions 

There are 12 noncoordinating conjunctions. Ten are 03 /D3 ‘also’, 1 is 4& ‘also’ (Ps 89:28), and 
1 is DYN ‘but indeed’ (Job 11:5). 
15.3.4.4 Modals 

There is a single one-segment modal, 4X ‘surely’, in 1 Chr 22:12. More interesting are the opta- 
tive modals that may be translated ‘would that’ in Job 6:8. Twenty of the 39 open interrogatives in 
the ji corpus are instances of this modal idiom. Phrase marker (15.1) for Job 6:8 shows how we 
have felt forced to handle the idiomatic clause:? 


(15.1) cl — , mdi  , nom cl sbj 


oblq gram nom oblq— open intrg ^ who 
b um — —————————Às wv he willgive 
Eod Niam let her come 
pos — ‘MOXY request 


"5 me 


]. The 6 intensifiers are in Num 21:2, 27:7; Deut 15:10; Judg 8:25, 11:30; and 2 Sam 5:19. The 2 amplifiers are in 
1 Kgs 5:9 and 2 Chr 25:9. 

2. See C. van der Merwe, J. A. Naudé, and J. H. Kroeze, Biblical Hebrew Grammar, 323, where the idiom is 
termed an interjection; also B. Jongeling, “L’expression my ytn dans l' Ancien Testament,” VT 24 (1974) 32-40. 
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In the clause, 7777) is an embedded clause that we have marked as nominalized and called a modal. 


15.4 Survey of the GF/SR Clause Immediate Constituents 


$15.4 Main Observations: (1) No adverb predominates in the 73 corpus. (2) The 
ini corpus exhibits a mild alternation in realizing indirect objects: 5 (91.4%) versus 5x 


(4.3%). (3) Three prepositions account for 81% of the PP CICs: 5, nx, and 2. (4) A 
bare substantive is almost twice as likely to be a direct object (864x) as to be a subject 
(493x). 


This subsection investigates linkages between CIC functions (CIC GF/SR labels) and their forms 
(CIC daughter labels). The CIC daughters census is shown in the table. 


Table 15.2. GF/SR Daughter Forms 
GF/SR Daughter 


Adverb 41 
Clause(-like) struct. 2 
Prepositional phrase 


Noun phrase 1:272 
Pronoun 


15.4.1 Adverbs 
The 41 adverbs are subdivided into temporal adverbs, spatial adverbs, and certain other adverbs 
(“adverbs of manner"). Here are the adverbs in [3 clauses: 


Temporal Adverbs Spatial Adverbs Adverbs of Manner 
(tm pt) (mnr) 
TM then 2 min here 3 thus 
Tip again (# times) | 3 ow there | 1 (emphatic) 
nnp now | 6 maw thither | Ni very 


15.4.2 Clause(-Like) Structures 
There are no clause-like structures. The 20 cases of 7-77) are not CIC daughters in jn clauses. 


15.4.3 Prepositional Phrases 
The ju clauses contain 2,763 CIC daughter prepositional phrases involving 19 prepositions: 


194 5»|100 'o|ss ^3|75 "w|16 nnn|16 wv] 10 oy|26 Misc: 


516 ny| 464 ^ 


1,262 '$ 


a. Eight different prepositions make up this miscellany. 
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15.4.3.1 Nearly Invariant Prepositional Phrases 
The preposition identifies the GF/SR of the PP in 3 instances. All CIC-daughter PPs: 
* with ns ‘(obj marker)’ (516x)—except | in 1 Kgs 11:35, obj cmp—are direct objects; 
* with ^3 ‘like’ (85x)—except in Exod 31:18, tm pt, and Gen 42:30, obj cmp—are 
comparisons; 
e with ov ‘with’ (10x) are accompaniers. 
The other PPs manifest 36 GFs/SRs. 


15.4.3.2 Prepositional Phrases with Temporal Semantics 

Prepositional phrases with temporal nouns occur 59 times. Their dominating CICs involve time 
(47x). Of 12 nontemporal CICs, 8 are comparisons involving time. For example, Jer 11:5 has: nr? 
nior» vam zn nar psx 079, which wies translates ‘to give them a land flowing with milk and 
honey, as is now the case’. 
15.4.3.3 The ^» Prepositional Phrases 

The 1,262 CIC daughter phrases involving ^? have the distribution shown in table 15.3. 


Table 15.3. "5-Phrase Substantive versus GFs/SRs 


substantive type 
ind obj aim obj cmp | [other] 


[pronoun] 498 5 
human 33 32 
other infinitive construct 7 
[other] 69 87 
Total 888 141 102 131 


Clearly ^? + [pronoun] or ^? + human mostly link to an indirect object GF (91%), while ^» + 
infinitive construct almost always links to an aim/ purpose SR (97%). 
15.4.3.4 The ^3 Prepositional Phrases 

PPs involving ^3 occur 464 times in | clauses (see table 15.4). 


Table 15.4. '3-Phrase Substantive versus GFs/SRs 


substantive type 
[other] 


body part 
geographic 


human 


temporal 28 2 
(other) 67 14 52 
Total 42 87 


The prefix “a + geographic usually links to a location SR, as does '3 + body part. 
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15.4.3.5 The v Prepositional Phrases 
Prepositional phrases involving 5v occur 194 times in jm clauses (table 15.5). 


Table 15.5. 5y-Phrase Substantive versus GFs/SRs 


substantive type 

[other] 
body part 7 
(pronoun) 6 
human 2 
geographic I 
(other) 6 
Total 22 


The by phrases are linked to location SRs 156 times (80.4%) and to indirect objects 16 times 
(8.0%), accounting for almost 89% of the instances of the 5p PPs in ju clauses. 


15.4.3.6 The Remaining Prepositional Phrases 

To this point, we have presented the prepositional phrases associated with 6 different preposi- 
tions. Together these account for 95.6% of the total number of CIC-daughter prepositional phrases 
in 73 clauses. Remaining for GF/SR specification are 122 prepositional phrases, involving 4 *ma- 
jor" prepositions plus a miscellany of 26 more. We give their tallies in a condensed format. 


1. '3—There are 100 prepositional phrases involving ^2 and its variants. Over half (54) 
involve position: location (loc) 10x and movement origin (mvt orig) 44x. The rest are 
scattered across 11 CIC labels (this scattering is quite similar to that of mn clauses). 

2. 5&— There are 75 instances of prepositional phrases involving 5s. There are 42 indirect 
objects, 27 involving movement, plus 6 others. 

3. nnn— There are 16 instances of prepositional phrases involving nnn. They appear as 
location (6x) and surrogate (10x). 

4. ty—There are 16 instances of prepositional phrases involving T». Nine are time aims, 6 
are movement aims, and 1 (Deut 2:5) is an area! 

5. Miscellaneous. We list the 8 prepositions with their counts (or references): 


pa ‘between’ (9x) 553 ‘on account of’ (1 Kgs 14:16, Jer 15:4) 
wn? ‘for (5x) 71) ‘except’ (Gen 32:17, 2 Sam 20:21) 
"ins ‘after’ (4x) nx ‘afterwards’ (Num 6:17) 


p ‘near’ (2 Kgs 12:10, Ezek 43:8) va ‘through’ (Job 2:4) 


15.4.4 Substantives 
The final constituents to be discussed are the 1,640 “bare” substantives.? Of the single-predicator 
ini clauses, 1,388 (69%) contain at least 1 bare substantive. 


3. That is, substantives that are not part of prepositional phrases. 
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Table 15.6. Substantives for the 7n1 Corpus 


substantive type 


[other] 


human 


deity 
[pronoun] 


geographic 


substance 


body part 


abstract 
food 


vocable 


[other] 
Total 


493 


964 


183 


We observe that in the jn3 corpus, a bare substantive is twice as likely to be a direct object as to be 
a subject (see table 15.6). If the substantive has deity semantics, it is a subject (99.5%). If a sub- 
stantive is not a pronoun or has semantics other than human or deity, then it is often a direct object 


(76.5%). 


15.5 Clause Immediate Constituent Incidence Contours 


Consider the bar chart for the clauses making up the 7n1 cohort. 


100 


20 


anchored verb 


inf. construct 


n-v participle 


CIC Incidence for Active jn Clauses 


2 
p 
o 
> 

v 
o 
-— 
o 

L 
o 
c 
(c 
cz 
3 


verb participle 
direct object 
indirect object 


subject 


subj*obj compl. ES 


location 


negation 


movement aim 


movement origin 


aim/purpose 
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The incidence of the various kinds of predicators is very similar to the incidence for the average 
clause. Across Biblical Hebrew, we find 46% of predicators are anchored and 54% are unanchored; 
for the 13 corpus, the percentages are 45% and 55%. Comparing relative frequencies, there are 
about three-quarters fewer subjects, about three-quarters more direct objects, and nearly 5 times 
more indirect objects. There are two-thirds more location SRs and somewhat fewer movement ori- 
gins and movement aims (12% versus 7%). 


15.6 Core Constituents and Their Ordering 


15.6.1 Core Patterns for the jni Corpus 
We plot the horizontal bar charts for ji clauses with anchored and unanchored predicators.* 


1m Core Patterns. Anchored Core Patterns. Unanchored 

v z= v E 

vs ] v E 

eC vo BENE 

v aam v aum 

vc vc | 

SV sv E 

ov | ov EN 

IV iv | 

CV cv | 

vso EM vso ai 

vsi | vsi 8 

vsc vsc | 

vos B vos J 

voi mM vo -"-— 

voc BM voc B 

vs | vs E 

vo NENNEN vo mnm 

svo | svo I 

ovs | ovs |l 

dd dd 
f T T T T T 1 [ T T T T T l 
0 10 20 30 40 50 60 0 10 20 30 40 50 60 


The table shows the sequence patterns for both anchored and unanchored predicators. 


Predicator Type Rank #1 Rank #2 | Rank #3 | Rank #4 | Rank #5 Rank #6 


Anchored VO 33% | VIO 15%} VOI 7%| VI 7% | VOC 6%| VSO 5% 
Unanchored VO 15% | VIO 8%| OVI 8% 6% 


These are spread-out distributions. For the anchored predicators, the “other” category includes 25% 
of the clauses. Its top few contributors are V, 5%; VSIO, 3%; VSOI, 2%; VISO, 2%; and VOIC, 
1%. For the unanchored cases, the “other” category is very large, 51%. It scatters across many CIC 
sequence patterns. Its top 4 contributors are SVO, 4%; SVI, 4%; V, 4%; IVO, 4%. Note that almost 
62% of the clauses with unanchored predicators are not predicator initial. 


4. The procedures for extracting the core patterns were specified in §11.3.2.2. 
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15.6.2 SHs amid GFs in the Qal Active jni Corpus 
Peripheral constituents interpose into the GF cores of clauses having jT as predicators fairly 
frequently, as is shown in the table: 


# Qal Percent of SR-Groups Interpolated amid the 7n1 GFs 

Active Other Movement Enriching | Mixed level 
Root Clauses | Participant | and Spatial | Temporal | Manner | Constituent | and DU 
mi 1,896 19.6% 15.1% | 15.5% | 35.0%| 10.0% 4.0% | 


The specifics underlying these figures are: 


e Other Participant. Of the 51 tokens, 10 are interpolated, an incidence of 10/51 or 19.6%. 

e Movement and Spatial. Of the 710 tokens, 107 are interpolated, 15.1%. 

e Temporal. Of the 103 tokens, 16 are interpolated, 15.5%. 

e Manner. Of the 20 instances of the Manner SR, 7 are interpolated, 35.0%. 

* Enriching Constituent. Of the 80 Enriching Constituent tokens, 8 are interpolated, 10.0%. 

* Mixed Level and Discourse Unit. Of the 249 Mixed Level and DU tokens, 10 are 
interpolated, 4.096. 


We count 158 interpolations. The "interpolation rate" is 83.3 interpolations per 1,000. We see that 
several kinds of adjunct significantly interpose into clause cores in the {ni corpus and that the in- 
terpolation rate is closer to the rate found for nwy than the rate found for WAX or irn. 


15.6.3 The Dative Shift 


At this point, we encounter the dative shift. The dative shift is the much-studied “phenomenon 
by which an underlying dative (indirect object) is realized as a direct object, the underlying direct 
object being realized as some sort of peripheral element."? Consider first an example from English. 
The clause “Bill gave the book to Mary" (sbj — vb — dir obj; — ind obj;) can be recast as “Bill gave 
Mary the book" (sbj — vb — dir obj; — 2nd obj;). The indirect object in the first clause is the direct 
object in the second clause; the direct object in the first is the second object in the second. This phe- 
nomenon occurs 10 times in the [n3 corpus. As an example, compare the “standard” V-I-O phrase 
marker (15.2) from Josh 15:19 with shifted-dative O2-V-O phrase marker (15.3) one clause earlier. 


(15.2) cl vb "n 
oba 7 gram nnn thou(m)wilt give 
CC ind obj to+pron tg 
gram prep ln 
Me me 


dir obj , n:geog FH basins of 


e eT — l 
Dn water 


5. R. L. Trask, A Dictionary of Grammatical Terms (London: Routledge, 1993) 71. 
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(15.3) 


cl 2nd obj n:geog 


oblq gram cstr VN 
n:geog , 

def ] 

aH 

vb 'nni 

gram T1 

dir obj "3 

gram . 


land of 

the 

Negeb 

thou(m) didst give 


me 


The 10 instances of the dative shift in the ji corpus are found in Gen 15:10; Josh 15:19; Judg 1:15; 
Isa 27:4; Jer 9:1; Ezek 16:7, 38; Job 9:18; Dan 1:9; and Neh 1:11. 


15.7 Verb Corpora Overview 


15.7.1 


CIC Incidence across Four Qal Active Verb Corpora 


The consolidated CIC incidences observed for the four sets of clauses investigated in chaps. 


12-15 are documented in table 15.7. 


Table 15.7. CIC Incidence Percentages for Four Qal Active Verb Corpora 


CIC 
[anchored vero | 327 | 363 | 355 | 245 | 350 
S | infinitive construct 12.4 8.3 
2 noun-verb participle 2.1 1.2 
s unanchored verb : 56.9 50.7 
verb ae 3, 5 0.1 4.2 4.7 
8 S| | | direct object — direct object 475 61.4 80.2 
© "| sbj/obj complement 5.2 42.9 5.0 11.1 
$ location 15.4 31.0 
E negation 5.9 Dui 
= mvt aim 0.5 5.0 
re 09 | 23 
E 8.4 - 
= | possessor - - 
S B. comparison 12.6 = 
e z time point : = ž 
à ~| beneficiary - - 5.0 8.2 - 
aim (purpose) - - - - 7.7 
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The 4 major Qal active verb corpora here studied account for almost one-fifth of the verbal clauses 
in Biblical Hebrew. Overall, anchored predicators appear in 46% of these Qal active verbal clauses, 
while unanchored predicators appear in 54%. While subjects occur in a bit over one-third of all ver- 
bal clauses (38%) and almost two-thirds of mn clauses (65%), they only appear in one-quarter of 
nwy (26%) and 3 (27%) clauses. Half of the verbal clauses in this Qal active corpus contain direct 
objects (48%). They are almost always found in 98 clauses (94%) and never found in m7 clauses. 
Indirect objects occur in around one-tenth of all verbal clauses (11%), one-third of the time in 78 
clauses (37%), and half the time in jr clauses (51%). Subject and object complements appear in 
one-twentieth of Qal active verbal clauses (5%) but four-tenths of mn clauses (43%). 


15.7.2 SH Interpolations for Four Verb Corpora 

Table 15.8 consolidates the CIC interpolation data observed for the four major verb corpora 
studied in chaps. 12-15. The results are ordered by increasing interpolation rate. The results for 
the complete Qal active corpus have also been introduced. (In all cases, clauses with compound 
predicators have been excluded.) 


Table 15.8. CIC Intrusion Percentages for Four Verb Corpora 


Core Percent of SR-Groups Interpolated amid GFs 
# Qal Intrusion Mixed 
Active Rate Movement Enriching | Level & 
(per 1,000) |Participant| & Spatial | Temporal| Manner |Constituent 


0.0% 
An 
m| 1896 | $3 | 196% | 151% | 155% | 35.0% 


2.6% 
7.3% 


The mapping from SR-groups to individual SRs may be found inside the back cover. 
Regarding the phenomenon of SR interpolation, a few observations are in order: 


e Qal active NX clauses usually have their SRs outside their GF cores. 

* The discourse-unit CICs remain outside GF cores to a large extent. 

* The "other participant" CIC group appears more likely to be interpolated into a core than 
any other group (to judge from the evidence for these four major roots and overall). 

* Verbs in Biblical Hebrew lie along a core-interpolation-rate cline. Of the frequently attested 
verbs, we conjecture that 178 and 777 are at or near the cline extremes. 

* Within a given verb corpus, does interpolation exhibit significant variation and, if so, what 
factors influence this variation? 


15.8 Brief Summary 


The Censuses. The predicators of 13 are strongly Qal actives (95.2%). Most predicators are 
finite verbs (85.8%), with infinitives construct the next-most common predicator (8.396). Non- 
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GF/SR CICs are uncommon (in 8% of clauses) with negatives being the most frequent (in 6% of 
the clauses). Usually the negatives immediately precede the predicators (95.4%). There are few 
adverbs; they appear in 2% of the clauses. With regard to GFs, direct objects appear in 80% of the 
clauses, explicit subjects in 27%, and indirect objects in 51%. With regard to SRs, 3 prepositions 
account for 76% of the PP CICs: 5, nx, and 1. The prefix 5 plus an infinitive construct is almost 
always an aim SR (95%), while any other substantive with 5 is typically an indirect object (79.4%). 
The preposition NX essentially always marks a direct object (99.8%). The prefix 3 plus a temporal 
substantive is usually a time point SR (93.3%), while any other substantive with à tends to be a 
location SR (77.2%). The bare substantives are usually direct objects (59%) or subjects (30%). 

CIC Incidence. Across Biblical Hebrew, we find 46% of predicators are anchored and 54% are 
unanchored; for the 11 corpus, the percentages are 45% and 55%. There are about three-quarters 
fewer subjects, about three-quarters more direct objects, and nearly 5 times more indirect objects. 
There are two-thirds more location SRs and somewhat fewer movement origins and movement 
aims. 

Core Patterns. The {ni corpus has one-quarter more unanchored predicators than anchored. 
For the anchored predicators, the “other” category includes 25% of the clauses. Its top few con- 
tributors are V, 5%; VSIO, 3%; VSOI, 2%; VISO, 2%; and VOIC, 1%. Among the unanchored 
cases, “other” is very large, 51%. It comprises many CIC sequence patterns. Its top 4 contributors 
are SVO, 4%; SVI, 4%; V, 4%; IVO, 4%. Note that almost 62% of the clauses with unanchored 
predicators are not predicator initial. Intrusions of SRs into the ni clause cores are fairly frequent, 
occurring at a rate of 83.3 intrusions per 1,000 clauses. 


Chapter 16 


Makeup of 
Clause Immediate Constituent Subtypes 


In this chapter, we examine the way that each CIC subtype is realized (as a CIC daughter) 
across all of Biblical Hebrew, independent of verb root. We include all CICs in all clauses. We 
visualize the data using tables and/or shaded bar charts. The tables characterize each CIC subtype 
by: 


* CIC incidence (labeled “Overall Token Count” in the tables) 
e CICs realized by noun phrases 
* Percentage that are noun phrases (labeled “Noun Phrase / Token 96") 
* Number of semantic classes found (labeled “Noun Phrase/Semantic Classes") 
e CICs realized by prepositional phrases 
e Percentage that are prepositional phrases (labeled “Prepositional Phrase / Token 96") 
e Number of preposition lexemes (labeled “Prepositional Phrase / Preposition Lexemes") 
e Number of distinct semantic classes (labeled “Prepositional Phrase / Semantic Classes") 
e CICs realized by other types of constructions (labeled *Other/Token 96") 


The shaded bar charts graph the tallies of the various realizations of CICs. 


Prepositional Phrase 


Overall 


CIC Subtype: Token Semantic Preposition | Semantic 
All inclusive Count Classes Lexemes Classes 
All CICs | 241,553 12 39 


This tells us that the 241,553 CICs found in Biblical Hebrew consist of slightly over one-quarter 
noun phrases, one-quarter prepositional phrases, and about half other entities. The NPs occupy 32 
semantic classes. The PPs involve 12 different prepositions (1 of them being a miscellany class). 
The substantives in prepositional phrases divide among 39 semantic classes. The “other” category 
consists of various kinds of particles plus clauses and sentences. Throughout this chapter, shaded 
cells contain the dominant row percentage. 


16.1 The Impermanent CIC Subtype 


In $9.3.1, we identify 3 sorts of “impermanent” CICs. The first sort (“underspecified ‘and’ cue 
phrases") is highly homogeneous in that its members are always and only realized as clause-initial 
coordinating conjunctions. They constitute a transitory CIC category, since they will eventually be 
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promoted into discourse level. The other 2 impermanent sorts (/apsus calami and nebulous) are 
each very heterogeneous, the former because we do not know how they function and the latter be- 
cause the great majority represent gaps in the texts filled by Qere readings. 


Overall Prepositional Phrase 


CIC Subtype: Token Semantic Preposition | Semantic 
Impermanent Count | Token % | Classes | Token % | Lexemes | Classes | Token 96 
'And' cue phrases | 25,812 10096 
Lapsus calami 
Nebulous 


One reads the table for the structures of the impermanent CICs as follows: the 25,812 “underspeci- 
fied ‘and’ cue phrases" are neither NPs nor PPs but are clause-initial coordinating conjunctions (and 
therefore appear in the table only under other"). The nebulous CICs, excluding 4 "other" realiza- 
tions (11%),' have 60% realized by 21 NPs assigned to 9 different semantic classes. The remaining 
29% are realized by 10 PPs involving 5 different prepositions and 9 different semantic classes. The 
24? lapsii calami CICs are indeterminate, hence the “?” entries. Three tokens are garbled words: 
Gen 30:11; Ps 10:10, 55:16. The other 21 tokens are Qere welo? Kethivs. 


16.2 The Syntactic Isolate CIC Subtype 


In $9.3.1, we identified 3 sorts of syntactic isolates. These are tallied in the table and com- 
mented on in the subsequent paragraphs, each of which comments on one row of the table. 


CIC Subtype: Semantic Preposition | Semantic 
Vocative 


589 
Exclamative 44 
Label 


Vocative Specifics. 45% of the vocatives have the semantic category human, 36% have the 
semantic category deity, 7% have the semantic category geographic (when a city is personified), 
and the remaining 12% are scattered over 19 semantic classes, for a total of 22 semantic classes 
for the noun phrases. Three instances involve “all of you,” each referring to the semantic category 
human. These are found in Judg 20:7, Isa 48:14, and 50:11. 

Exclamative Specifics. 63% (217/344) of the exclamative CICs are single segments classified 
as exclamatives (“other”). Another 19% are noun phrases with the semantic category medical, 13% 


1. These may be found in 2 Sam 14:26, 19:7; 2 Kgs 9:25; and Isa 51:19. 

2. In §3.2.1.1, we give the locations of 32 /apsii calami; here we tally only 24. Why this discrepancy of 8 instances? 
There are 8 places where a lapsus calami is not the daughter of a laps CIC. Six are located within phrases: 1 Sam 4:13, 
2 Sam 8:3, 1 Kgs 12:33, 2 Kgs 4:7, Job 26:12, and Lam 4:3. Two are CIC daughters, but in the version of our database 
here used, we failed to assign them laps tags: Deut 33:2 (sbj) and 2 Chr 34:6 (loc). 
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are noun phrases with the semantic category quantity, and 2% are noun phrases with the semantic 
category mental, plus 4 additional semantic classes. The single PP is in Gen 30:13 (“in my hap- 
piness"). Since it is presently a unique realization of an exclamative, one might well examine its 
context to see if an alternate CIC assignment might be more appropriate. 

Label Specifics. 71% are vocable noun phrases (717-03 ‘oracle of the Lord’); 12% are nebu- 
lous noun phrases (n79 ‘Selah’); and 11% (72/683) are PPs with 5 ‘to’ plus a noun with human 
(93%), place, or other semantics (such as TIT? onu ‘psalm of/to David’). 


16.3 The Predicator CIC Subtype 


Predicators are realized by various predicator subtypes (the rows in the table). The column en- 
tries rank common verb forms (binyan + voice) from most-frequent (“Rank one") to fourth-most 
frequent (“Rank four"). The percentages are percentages of the associated “Token Counts.” 


Rank one Rank two Rank three | Rank four 
Predicator Subtype Count Form Form Form Form 
Infinitive absolute 195 |QA* 66% |HA 18% |PA 9% |NP 5% 


HA 14% |PA 11% |NP 2% 
HA 28% |PA 6% |NP 3% 
HA 11% | NP 8% | QP 7% 
compound 23% 
HA 14% |PA 10% | NP 6% 


Infinitive construct 6,722 
Noun-verb/noun participle 109 
Noun-verb participle 2,336 
Verbal participle 2,787 
Unanchored finite verb 35,447 
Anchored finite verb 20,735 
Quasiverbal” 1,255 


a. Binyan: Q = Qal; P = Piel; H = Hiphil; N = Niphal. Voice: A = active; P = passive. 
b. The quasiverbals are the subject of chap. 18. 


16.4 The Operator CIC Subtype 


The 6 CIC types that we define to be operators were described in §9.3.4. Since they are realized 
neither by noun phrases nor by prepositional phrases, we simply list their characteristics. 


e Negation operator (5,954x). 85% are 8» ‘not’, 12% are Ox ‘do not’, and 3% are 5a 
‘[negation]’. 

e Closed interrogative (876x). 87% are -n *yes/no?', and 13% are OX ‘[question]’. 

* Clausal 03/03 ‘also’ (326x). Always the segment that provides the name of this subtype. 
Most of the 03/03 ‘also’ segments (58%) are supra-clausal. 

e Includer (84x). These are all clausal 48 ‘also’. 

e Restricter (29x). These are p? ‘only’ and many instances of JX. 

* Modal (265x). The modals are listed in $3.2.2.1, with the idiom type discussed in 
$15.3.4.4. 
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16.5 The Grammatical Function CIC Subtype 


16.5.1 Subjects 

At the start of §9.3.5.1, we pointed to examples of various kinds of subjects: the standard sub- 
ject (of which, we find 33,572, with 310 being distributive),* the Janus subject complement / subject 
(of which, we presently have declared 107), and the subject of address (of which we find 22), yield- 
ing a grand total of 33,701. These are tallied and characterized below. 


Overall Prepositional Phrase 


CIC Subtype: Token Semantic Preposition | Semantic 
Subjects Count | Token %| Classes | Token %| Lexemes | Classes | Token 96 


Uer — |s [ 99 | db | ose | s| 2 
5 | os 
Subject ofadiress | 2] 0|  —1| 9 


Subject Specifics. The main semantics labels for the NPs are 36% human, 10% deity, and 4% 
each for body part and geographic. In addition, 11% of the subjects are free pronouns and 4% are 
suffixed pronouns. 

Distributive Subject Specifics. 92% of the distributive subjects have human semantics. 

Janus sc/sb Specifics. 34% have human semantics, and 26% have abstract semantics. 

Subject-of-Address Specifics. All 22 of the subjects of address consist of single clauses. 


16.5.2 Objects 
The characteristics of the various sorts of objects are as shown: 
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3. Suspended and resumed subjects are included. 
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Overall 
Token 
Count 


CIC Subtype: 
Objects 
Direct object 


Semantic 
Classes 


Second object 
Object of address 
Indirect object 


Only 37% of the direct objects and only one-quarter of the second objects are prepositional 
phrases. All of the objects of address have been parsed as clauses, sentences, or discourse structures 
(are *other"). And essentially all of the indirect objects are realized as prepositional phrases. 

Direct Object Specifics. The leading semantic labels for the noun phrases are 16% human, 7% 
body part, and 6% vocable. In addition, 19% (5,395x) are suffixed pronouns. Free pronoun objects 
are rare (0.4% [107x], all but 2 are demonstratives). With regard to the prepositional phrases, the 
most common involve the object marker* plus: a human noun (26% of occurrences), a suffixed 
pronoun (19%), a place (9%), a body part (7%), or a vocable (6%). 

Distributive-Object Specifics (18x). 2196 of the distributive objects have human semantics, 
14% are utensils, and 10% are numbers. 

Second-Object Specifics. The makeup of the second objects is remarkably diverse. The most 
frequent realization is the human NP with 13%. Following along behind are the place, utensil, and 
substance NPs, each with around 446. And so on, and so on. 

Object-of-Address Specifics. 57% of the objects of address involve sentences, while 43% are 
single clauses. Lost in the rounding off of these percentages are 27 instances (0.596) that are dis- 
course units. 

Indirect-Object Specifics. The indirect objects are overwhelmingly realized by prepositional 
phrases. The preposition is 5 in 58.4%, 58 in 37.3%, and 5p in 3.2%. The remaining 1.1% are scat- 
tered among 7 other prepositions. 

Distributive-Indirect-Object Specifics (40x). All are realized as prepositional phrases. 65% are 
5} + human, and 30% are 5 + human. 


16.5.3 Complements 
The characteristics of the various sorts of complements are as shown: 


Overall Noun Phrase Prepositional Phrase 
CIC Subtype: Token 


Semantic Preposition | Semantic 
Complements Count | Token 9o | Classes Lexemes | Classes | Token 96 


Subject complement 7,053 87% 
Object complement 1,237 64% 


Cmpl. aspect. 


Cogn. cmpl. 


4. Of 10,705 direct objects realized by PPs, 83% involve the object marker, 12% 3 ‘in’, and 3% 5 ‘to’. 
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Subject Complement Specifics. Nearly 90% of subject complements are bare nominals (87%). 
These are scattered across a large number of semantic classes. As for the prepositional-phrase real- 
izations (631x), the most common preposition is 9 ‘to’ (86%). 

Object Complement Specifics. Regarding substantives, the most common semantic label is hu- 
man, 20%. Next most frequent are noun-verb participial constructions, 10%. The dominant prepo- 
sitional lexeme is 5 ‘to’, 92%. 

Complement of Aspectualizing Verb Specifics. Almost all of the nominal phrases (91% of 
them) involve a nominalized infinitive construct. The nominalized infinitives construct account for 
17% of this class of CICs. By far the most common realization among the PPs (362x) involves 5 
‘to’ + infinitive construct (98%). 

Cognitive Complement Specifics. In 16 instances, a compound cognitive complement is pres- 
ent. Phrase marker (16.1) from Gen 29:12 provides an instance of this phenomenon (see p. 224). A 
non-compound complement clause may (81%) or may not (19%) have an associated cue phrase. In 
(16.1), the cue phrases are homographs of 73/73, here glossed ‘that’. 


16.6 The Semantic Role CIC Subtype 


We divide the semantic role CICs into the seven subgroups listed inside the back cover. Some 
of the subgroup assignments may arguably have been made differently. 


16.6.1 Other Participants 
The characteristics of the various sorts of “other participant" CICs follow PM (16.1): 
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Overall Noun Phrase Prepositional Phrase 
CIC Subtype: Token m" Semantic EP «E 
Other Participants | Count | Token %| Classes | Token 96| Lexemes | Classes 
Accompanier 1,668 1% 10 9 30 
10 | om |  - s| 18 
Beneficiary 2,097 1% 8 5 31 
Harmed one 261 1% 2 li 17 
Involved one 80 
With very few exceptions, “other participants" are realized by prepositional phrases. Across all 7 
“other participant" CIC types (table rows), only 4 instances of the “other” class occur. 
Accompanier Specifics. By far the most common preposition lexeme is some form of nv 
*with', which appears in 1,501 (9096) of these CICs. With regard to the semantics of the nominals 
in the prepositional phrases, by far the most frequent are the pronoun suffixes (5896) and nominals 


having human semantics (2696). While it is unusual to see so many different semantic classes as- 
sociated with a CIC class (10 in all), they do seem bona fide. 


Token 96 
096 
096 


096 
096 


096 
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Agential Specifics. The most common agential expression translates as “in hand of <X>’. This 
sort of construction accounts for 51% of the agential realizations. The runners-up involve some 
form of 1 ‘in’ plus either a suffix pronoun (16%) or a deity (8%). 

Alternate | Surrogate Specifics. These are all prepositional phrases, the vast majority (93%) 
involving nnn ‘underneath’; 53% involve pronoun suffixes, while 22% involve nominals having 
human semantics. 

Beneficiary Specifics. By far the most common preposition is 5 ‘to’ (9096). The associated 
nominal is a pronoun suffix (61%), has human semantics (16%), and divine semantics (5%). 

Harmed-One Specifics. The distribution of the prepositions is less concentrated than for the 
CICs examined above. The data are: 3 ‘in’ 38%, b9 ‘upon’ 41%, and 5 ‘to’ 10%. The harmed ones 
involve pronoun suffixes 43% of the time, humans 36% of the time, and deities 8% of the time. 

Involved-One Specifics. With one exception (Qoh 7:27), this CIC involves prepositional 
phrases. The preposition is pa ‘between’ in 87% of cases. The nominal is a pronoun suffix 45% and 
a human 36% of the time. The typical formula is ‘between X and between Y’. 

Possessor Specifics. The preposition is 5 ‘to’ in 97.4% of the cases. The nominal is a pronoun 
suffix 58% of the time, has human semantics 25% of the time, and divine semantics 4% of the time. 


16.6.2 Movement, Actual and Notional 
The characteristics of the movement CICs are as shown (with table following on p. 226). 
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Movement Aim Specifics. This CIC type exhibits 126 prepositional phrase patterns. The prep- 


osition is 58 54% and 5 27% of the time. The 4 most frequent realizations are: 5x + place, 12%; 5x 
+ human, 10%; place nominal, 11%; 58 + pronoun suffix, 9%. 
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Overall 
CIC Subtype: Token 
Movement SRs Count 


1538 | 15% 


Bo | 1 
so | 294 


Movement origin 0% 


Movement Bearing Specifics. 81% of the prepositions are 5. The associated nominals are: suf- 
fix pronoun, 42%; deity, 26%; human, 17% (nominals of place only occur in 3% of the PPs!). 

Movement Interval Specifics. 11% of these CICs involve a bare spatial noun, 9% a place 
noun. For the PPs, the preposition is some form of (n ‘from’ 91% of the time. The most popular 
associated nominal semantic labels are: place, 30%; human, 20%; spatial, 18%. 

Movement Origin Specifics. Although 5 different preposition lexemes are attested, 1 ‘from’ is 
the overwhelming favorite, 99%. The most popular associated nominal labels are: place, 25%; body 
part, 17%; pronoun suffix, 13%; human, 12%. 


16.6.3 Spatial 
The characteristics of the various sorts of spatial CICs are as shown. 
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Overall 
CIC Subtype: Token Semantic 
Spatial SRs Count Classes 
Area 


Separation 


Area Specifics. 48% of the area specifics are complex geographical specifications (as in Exod 
23:31); 21% are echo noun phrases found only in Ezekiel:? 220 139 ‘all around’ (NiPs). 

Length Specifics. This is usually realized as a nominal headed by 2°20 ‘around’. 

Location Specifics. The noun phrases are fairly sparsely attested, accounting for 7% of the 
location CICs. Place nouns make up 4% of the overall total. The prepositional phrases occur in 
almost 150 patterns. The “other” category includes 1,024 adverbs of space (53 being locatives). 

Separation Specifics. A small, scattered class. The sole CIC assigned to “other” is in Ps 34:14. 


16.6.4 Time 
The characteristics of the various sorts of time CICs are as shown. 
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5. See Ezek 37:2; 40:5, 16; 41:9, 15, 19; 43:12. 
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Overall Noun Phrase Prepositional Phrase Other 
CIC Subtype: Token Semantic Preposition | Semantic 
Temporal SRs Count | Token % | Classes | Token % | Lexemes | Classes | Token 96 
Time aim 5 18 096 
Time interval 8 T 21 1496 
Time origin l 4 15 0% 
Time point 3,708 15% 9 75% 9 29 10% 


Time Aim (Goal) Specifics. Three prepositional-phrase shapes outnumber all of the others: 5 + 
time, 38%; Ty + time, 34%; Ty + infinitive construct, 17%. 

Time Interval Specifics. Nominals having temporal semantics constitute the largest group, 57% 
overall. The second most common pattern involves temporal adverbials, 14%. Among the prepo- 
sitional phrases, time nominals are the most common, being 38% of the nominals in prepositional 
phrases. There are 3 curious prepositional phrases involving the object marker: Exod 13:7, Deut 
9:25, and Ezek 16:22. The “other” entry (161x) consists almost entirely of the temporal adverbs. 

Time Origin Specifics. The most common pattern is some form of m + time nominal, 59%. 
Forms of Į) are the most common preposition, 81%. The lone noun phrase occurs in Isa 40:21. 

Time Point Specifics. The most common shape is forms of 2 + temporal noun phrase, 35%, 
followed by forms of 3 + infinitive construct, 19%. Next are the time noun phrases, at 13%, fol- 
lowed by temporal adverbs, at 10%. The most common preposition is 3, which accounts for 76% 
of the prepositions. 


16.6.5 Enriching Constituent 


The characteristics of the various sorts of enriching constituent CICs are as follows. 


Overall Prepositional Phrase Other 


CIC Subtype: Preposition| Semantic 
Enriching Constituent Lexemes | Classes |Token % 
Cost 5 17 0% 
Instrument 8 32 3% 
Manner 2,457 10% 8 
— me 


| Number count ^ | | Number count ^ | 100% 
Number of times 34% 
Reference 1,109 


Resource (source of supply)| 248 
Ruled-over one 45 0% 
Quantity / quantifier 50 68% 0 


Cost Specifics. The surprise here is that only 4 costs involve bare noun phrases (Num 18:21, 
31; Deut 15:18; Isa 5:23). The prime preposition is 1, 79% of the prepositions. The nominals in the 
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prepositional phrases are scattered. The top few are: human, 19%; valuables, 18%; and substance, 
13%. 

Instrument Specifics. The most common bare nominal semantics is substance, 2%. The most 
common preposition is 3, 83% of the prepositions. The most common nominal constituents of the 
prepositional phrases have semantics: utensil, 23%; body part, 14%; abstract, 11%; substance, 10%. 

Manner Specifics. 72% of these CICs are adverbial. The top performers are the various forms 
and congeners of 515/12 ‘thus’, 39%; TRN ‘very’, 8%; various forms of TIT ‘together’, 6%. 

Material/Composition Specifics. Two-thirds of these CICs are bare nominals having sub- 
stance semantics (66%). Vegetation nouns account for another 14%, and utensil nouns for an ad- 
ditional 10%. 

Number Count Specifics. This class of CICs is a place saver. The single instance is at Josh 5:2. 

Number of Times Specifics. The 223 CIC instances classified as “other” involve the temporal 
adverb Tiy ‘again’ and account for 62% of the instances. Bare temporal nominals account for 34%. 

Reference Specifics. This class is the “when-all-else-fails” category. Consequently, its seman- 
tic role is vague, and its makeup is quite disparate, because there are over 100 different patterns at- 
tested. The predominant referent pattern involves 5 ‘to’ + human, 21%. The runner-up is 5» ‘upon’ 
+ pronoun suffix, 12%. Five interrogative CICs have been assigned to “other” (0.4%). 

Resource (Source of Supply) Specifics. The 2 prepositions involved are forms of 7/2 ‘from’ 
(94%) and forms of 3 ‘in’ (696). Such a construction is usually termed a partitive 72. The majority 
of the nominals in the prepositional phrases have human semantics (5696); 896 are pronoun suffixes. 

Ruled-over-One Specifics. At present, this class is incompletely marked up; 58% of its in- 
stances involve 5p ‘upon’ + human, and 18% involve op + pronoun suffix. 

Quantity / Quantifier Specifics. 66% of these are numbers. 


16.6.6 Phrasal Discourse Unit 
The characteristics of the various sorts of phrasal discourse unit (phrasal DU) CICs are as 
follow. 


Overall Noun Phrase Prepositional Phrase Other 
CIC Subtype: Token Semantic nw Preposition | Semantic 
Phrasal DU Count | Token 96 | Classes | Token 96 | Lexemes | Classes | Token % 
Comparison 2,682 1% 7 35 0% 
Cause 63 3% 5 13 0% 
Deprivation 99% 4 0% 
Quoter 1 0% 


Comparison Specifics. Naturally, the most common preposition is 3 ‘like’ and its congeners, 
89%. The top 4 semantic categories for the nominals found in the prepositional phrases are: nomi- 
nalized clause, 21%; human, 11%; vocable, 7%; creature, 7%. 

Cause Specifics. This CIC type is feebly realized at phrase level; it will loom much larger 
once we Classify supra-clausal discourse units. The leading prepositional phrases are some form of 
a + body part, 17%; forms of 3 + abstract noun, 13%; and 7/2 + mental, 10%. 
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Deprivation Specifics. Deprivation is most often expressed by some form of 7 + human, 
14%, or some form of 77) + number, 7%. 

Quoter Specifics. All of these are 5 + infinitive of utterance (7/8’9). 
16.6.7 Mixed Level 

The characteristics of the various sorts of mixed-level CICs are as follows: 


Overall Noun Phrase Prepositional Phrase 
CIC Subtype: Token Semantic Preposition| Semantic 
Mixed Level Count Lexemes 
Aim / purpose 


26 1396 
9 096 


Reason 
Result 
Undesired outcome 


We have omitted 3 mixed-level “place saver" categories? from this table: 


e But rather: only so labeled in Deut 4:12 and 1 Kgs 12:20. 
* Concessive: only so labeled in Ezek 32:30. 
* Condition: only so labeled twice, in Gen 4:7 and Job 4:2. 


Aim/ Purpose Specifics. The vast majority consists of 5 + infinitive construct, 93%. 

Reason Specifics. This highly diffuse type most often occurs as the prepositional phrase 7/ + 
body part, only 16%. The 106 instances of “other” are overwhelmingly interrogatives (101x). 

Result Specifics. Mostly prepositional phrases, this CIC type is most often realized by 5 + 
infinitive construct, 39%, and 5 + quality / quantity, 14%. 

Undesired Outcome Specifics. Almost entirely prepositional phrases, the 3 top contributors 
are [2 + infinitive construct, 74%, 5 + infinitive construct, 6%, and 7 + human, 6%. 


16.7 Brief Summary 


Procedures. Across all of the clauses of Biblical Hebrew, we have examined CIC subtype 
makeup in terms of whether and when the CICs are realized by noun phrases, by prepositional 
phrases, or by other entities. The presentation consists of: (1) table entries for CICs that are primar- 
ily of one sort and/or (2) bar charts when there is significant variety. 

Impermanent, Syntactic Isolate, Predicator, and Operator CICs. The impermanent subtype 
provides little of interest, because it is a "ragbag." With regard to the syntactic isolates, vocatives 
are almost always realized by NPs (10096), about half of them having human semantics and about 
one-third having divine semantics. The exclamatives are realized by exclamative POSs 63% of the 
time and otherwise by NPs involving 8 different semantics classes 37% of the time. Labels are real- 
ized by NPs 89% of the time. Predicators are most frequently realized by Qal actives. With regard 
to the operators, each kind is realized by lexemes of the relevant POS. For example, the 5,943 
closed interrogatives are 89% - ‘yes/no?’ and 11% DX ‘[question]’. 


6. That is, we have not yet assigned these categories systematically. 
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Grammatical Function CICs. Subjects are overwhelmingly NPs (99%); direct objects are 
62% NPs and 3796 PPs; objects of address are 100% clauses, sentences, or discourse structures; 
and indirect objects are essentially 100% PPs. Complements, a mélange that we smuggle in among 
the grammatical functions because of their importance for the 77-corpus and verbless clauses, 
consist of subject complements (89% NPs and 9% PPs), object complements (64% NPs and 35% 
PPs), complements of aspectualizing verbs (17% NPs and 82% PPs), and cognitive complements 
(10096 clauses, sentences, or discourse structures). 

Semantic Roles. With regard to Other Participant SRs, essentially 100% are PPs. With regard 
to Movement SRs, movement aim is mixed (15% NPs and 84% PPs) as is movement interval (29% 
NPs and 70% PPs), bearing is mixed (12% NPs and 88% PPs), while origin is essentially 100% 
PPs. With regard to Spatial SRs, location is mixed (7% NPs, 85% PPs, and 7% adverbs of space). 
The location SR (14,557x) dwarfs the other spatial SRs: area (33x), length (189x), and separation 
(23x). With regard to the Time SRs, time aim is 9996 PPs, time interval is significantly mixed (5896 
NPs, 2846 PPs, and 1496 temporal adverbs), time origin is 10096 PPs, and time point is mixed (1596 
NPs, 75% PPs, and 10% temporal adverbs). With regard to Enriching Constituent SRs, material, 
number count, and quantity are basically NPs; cost, instrument, reference, resource, and ruled- 
over one are basically PPs; manner SRs are usually adverbs (74%), as are number of times (62%). 
With regard to Phrasal Discourse Units, all (comparison, cause, deprivation, and quoter) are es- 
sentially 100% PPs. With regard to Mixed-Level SRs, all (aim / purpose, reason, result, undesired 
outcome) are around 90% PPs. 


Chapter 17 


Computing the Distances 
between Verb Corpora 


In this chapter, we introduce methods for computing and representing the distances between 
verb corpora. Knowing these distances allows us to cluster the verb corpora into natural affinity 
groups. This, in turn, permits us to build up a powerful generalization storehouse, the hierarchical 
lexicon. While the ideas underlying all this are not simple, their grasp should reward perservering 
readers. 


17.4 The Hierarchical Lexicon, Verb Classes, and Inter-Clause Distance 


17.1.1 The Hierarchical Lexicon 

Grammarians are ever on the prowl for generalizations about the languages that they study. The 
hierarchical lexicon is a construct that is designed to make certain language generalizations explicit. 
In a traditional lexicon, each entry contains the particular information that pertains to the lexeme 
being described. There is much redundancy, wasted space. In a hierarchical lexicon, each lexeme 
is positioned in a hierarchical structure. It inherits many of its characteristics from its “ancestors.” 

When viewing the lexicon from the syntactic perspective, one works in terms of syntactic cat- 
egories. One may then observe that the characteristics of Nouns are inherited by CommonNouns 
(in other words, CommonNouns are daughters of Nouns). Similarly, ProperNouns inherit properties 
from Nouns of which they also are daughters. Getting even more specific, the lexeme Book inherits 
characteristics from the CommonNouns, so we may write: 


Substantive > Noun > CommonNoun > Book 
The > symbol can be read “contains.” 
When viewing the lexicon from the semantic perspective, one uses a hierarchy of semantic cat- 
egories. One much-used hierarchy! allows us to assert that: 
Entity > Physical > Object > SelfConnectedObject > CorpuscularObject > Artifact > Text > Book 
In practice, matters are more complicated, as Forbes has discussed.” The main point of all this, for 
our purposes, is that the formation of a hierarchy involves the making of useful generalizations. 


1. Ian Niles and Adam Pease, “Towards a Standard Upper Ontology,” Proc. 2nd Int. Conf. on Formal Ontology in 
Info. Sys. (FOIS-2001), www.ontologyportal.org. The building of taxonomies and their very useful extensions, ontolo- 
gies, is notoriously labor intensive. 

2. A. D. Forbes, “How Syntactic Formalisms Can Advance the Lexicographer’s Art,” in Foundations for Syriac 
Lexicography III (ed. Janet Dyk and Wido van Peursen; Piscataway, NJ: Gorgias, 2009) 150-55. 
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17.1.2 Verb Classes 

Linguists find it useful to think in terms of “verb classes.” Once verb classes are created, one 
can seek generalizations class by class rather than verb by verb. Beth Levin has classified the En- 
glish verbs, producing a four-level hierarchy: Verb > VerbClass [49 in all] > SubClass > SubSub- 
Class.? The subsubclasses become awesomely specific. Example: 


Verbs > VerbsOfExistence > VerbsOfGroupExistence > SwarmVerbs > 
(abound, bustle, crawl, creep, hop, run, swarm, swim, teem, throng }* 


17.1.3 Inter-Clause Distance 


17.1.3.1 The Distances between Sets of Clauses 

We seek to group verb corpora, based (for starters) on clause immediate constituent incidence. 
Consider the pair of “stripped” CIC incidence plots? for the sets of clauses involving 98 and mn 
shown below. Clearly, the two sets of clauses characterized are quite different. For example, in the 
seventh bar position (direct-object incidence / object-of-address incidence), the two bars in the two 
charts are very different in height. While 7/8 clauses almost always have a direct object (actually, 
object of address), the m7 clauses have none. 
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3. Beth Levin, English Verb Classes and Alternations: A Preliminary Investigation (Chicago: University of Chi- 
cago Press, 1993). 

4. Ibid., 253. 

5. By "stripped," we indicate that bar labels have been removed. 
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We ask: “With regard to CIC incidence, what is the specific distance between the “N8 corpus and 
the mn corpus?" 

Or consider these CIC-incidence plots for the sets of clauses involving 82 and n»y. Clearly, 
these two sets of clauses are quite similar. But how far apart are the corpora that they describe? 
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If we call the unit of inter-clause distance the lambdin (or lam), in memory of the Harvard profes- 
sor, then the *2N and m7 sets of clauses are 17.42 lams apart, while the &3 and n5y clauses are 1.72 
lams apart, 10 times closer together. They are the closest together of Qal active verbs occurring 
more than 60 times. Next, we informally sketch the way that we obtained these distances. 
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17.1.3.2 The Estimation of Inter-Clause Distances 
Estimation of inter-clause distances involves analytical subtlety. In particular, one must: 
e Specify the sample space (our data lie in a 23-simplex), 
e Transform the data (via the closure operation), 


* Remove the sampling zeros (via multiplicative replacement), 
e Select the proper distance measure (the Aitchison metric). 


If this sounds fairly technical, this is because it is. The details (with references) are given in appen- 
dix 5. But we can provide a sense of what is involved in layperson's prose: 
One decides which incidences to use. The resulting CIC incidence data require special han- 
dling for three reasons: (1) When the predicator incidences are expressed as fractions, they 
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add to unity. (2) The incidences all lie between zero and one. (3) A few of the incidences may 
happen to be precisely zero. Each peculiarity must be addressed as one devises the distance- 
estimation process. The first issue is addressed by selecting an appropriate space in which 
to situate the data points, the second by appropriately normalizing the data, and the third by 
adjusting the data so that any zero values are replaced by appropriate tiny values with the 
other values suitably compensated. Once all this is done, the statistical literature supplies an 
appropriate formula for the distances sought. 


CIC Incidences Used. Any book on pattern recognition devotes considerable space to the 
problem of feature selection. This is because, when one is building a pattern recognizer, the use of 
too many features can lead to very brittle systems that do not classify never-before-seen data ac- 
curately. Because we are not attempting pattern recognition but, rather, are merely seeking to infer 
the affinities exhibited within the data at hand, we can afford to be rather blasé in regard to feature 
selection. Consequently, we elect simply to use as our features a subset of the incidences associated 
with the most frequently attested CICs. We use the incidences of all of the CICs that occur more 
than 1,500 times in Biblical Hebrew, 23 CICs in all. Included are the 17 CICs listed in the CIC 
incidence plot for mn in $13.5 plus aim, instrument, closed interrogative, manner, accompanier, 
and vocative. 

CIC Incidence Adjustments. By constraining the fractional predicator incidences associated 
with each verb corpus to sum to unity, we make them consistent with the required sample space, 
a simplex.° The distance metric suited to such a space (the so-called Aitchison metric) involves 
the logarithms of each of the CIC incidences. If an incidence is zero, then the distance involves an 
infinite value. We, therefore, must adjust the zero entries appropriately in order to avoid all these 
sorts of infinities. Precisely how this is done is explained in $A5.2.4. 


17.1.4 A Simple Example Involving Four Clause Sets 

In chaps. 12-15, we presented the CIC-incidence bar charts for clauses built around four very 
frequently attested Qal active verbs: NN ‘say’, mn ‘be’, nwy ‘make’, and ji ‘give’. These proved 
to be quite disparate in their behaviors. Proceeding along the lines just outlined (executing the spe- 
cifics given in appendix 5), we estimate the inter-clausal distances for the four sets of clauses as 
follows: 


Aitch 
| nn 
nwy 


17.0 
9.6 


We see that N8 and m7 are the farthest apart (17 lams), while nwy and qni are the closest together 
(5 lams). We note also that nwy and jru are nearly equidistant from X (~9.5 lams) and also from 
mn (~13 lams). Given these few sets of clauses, we might use the six distances to produce a two- 


6. A three-simplex is better known as an equilateral triangle. A four-simplex is better known as a tetrahedron or 
pyramid. 
7. The abbreviation “Aitch” is a reminder that the table holds distances computed using the Aitchison metric. 
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dimensional map having the verb corpora as its "cities." The distances, however, only determine 
the relative positions of the corpora and not the orientation of the map. This, of course, is true of 
the mileage chart on a map. From an appropriate mileage chart, we find that Paris is 342 kilometers 
from London, but we cannot discover that Paris is southeast of London. 


17.2 Inter-Clause Clustering 


17.2.1 Options for Inter-Clause Representation 


Just how the four sets of clauses are positioned with respect to each other is not immediately 
obvious from the table of distances, but there are well-understood ways of producing displays 
showing how objects are positioned relative to each other on the basis of tables portraying the dis- 
tances between them. There are two basic approaches, geometrical and clustering.* 


17.2.2 The Geometrical Approach to Representation 

In the geometrical approach, one attempts to project data from a high-dimensional space onto 
a lower-dimensional space and then leave it to observers to infer which projected objects “go to- 
gether" to form clusters that have a great deal in common. Ideally, one would project the data onto 
a two-dimensional space (a plane). We have done this sort of thing in the context of the analysis of 
Hebrew spelling practices.? As usual, there is a possible hitch: if the data being represented "fully 
inhabit” the high-dimensional space,'? then projecting them onto a low-dimensional space will 
distort the distance relations, perhaps catastrophically. There are ways of assessing the adequacy 
of a given projection, but they are too technical to be included here.'! For our situation, suffice it 
to assert that projections onto lower-dimensional spaces are too distortive, making the geometrical 
approach less than appealing. ? 


17.2.3 The Clustering Approach to Representation 

The inference of the verb class clusters remains to be considered. Clustering has long been a 
staple of taxonomists. Attention to it accelerated after Sokal and Sneath published Principles of 
Numerical Taxonomy in 1963." In recent years, interest has been intensified because of its growing 
usefulness in “data mining" and “bioinformatics.” 


8. For an accessible discussion of these options in the context of biblical studies, see chap. 8 (“Choice of Sta- 
tistical Methods") of D. N. Freedman, A. D. Forbes, and E I. Andersen, Studies in Hebrew and Aramaic Orthography 
(Biblical and Judaic Studies from UCSD 2; Winona Lake, IN: Eisenbrauns, 1992) 93-110. 

9. Ibid., 104-10. 

10. As an example, consider two classes of data—the members of one class being constrained to lie within a 
sphere and the members of the other lying within a spherical shell concentric to the sphere holding the members of the 
first class. Projecting this three-dimensional configuration onto two dimensions will overlap the two clusters badly. In 
three-dimensional space, the separation of the classes is trivial. If a point is within the sphere, it is a member of the first 
class; otherwise, it is a member of the second class. No such (perfect) decision rule can hold for the two-dimensional 
projection. 

11. Technical note. They assess the fraction of the total variance accounted for by various projections and involve 
ratios of linear combinations of the squares (or absolute values) of the eigenvalues of the principal components. See for 
example, Brian Everitt, An R and S-Plus® Companion to Multivariate Analysis (Berlin: Springer, 2005) 46—47. 

12. Technical note. Put more concisely and technically, our eigenvalues asymptote to zero too slowly. For the 
movement verbs discussed in §17.3.2, the two-dimensional projection includes only 55% of the total variance, the three- 
dimensional projection only 68%, and the four-dimensional projection only 77%. 

13. R. Sokal and P. Sneath, Principles of Numerical Taxonomy (San Francisco: Freeman, 1963). 
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Of the several approaches to clustering, we prefer agglomerative (or “bottom-up”) clustering. 
In this variant, one initially considers each object being classified to be a (singleton) cluster. One 
then proceeds as follows: 


e Step A. Based on the estimated distances between the various objects, one combines the 
two closest objects to form a cluster. 

* Step B. Then, using one of several available rules,!* one computes the distances from 
the just-formed cluster to all of the other clusters. 

* Step C. One returns to Step A until there is only one cluster, at which point the clustering 
has been completed. 


The outcomes of this process are represented by a tree structure, or a “dendrogram.” Here is the 
dendrogram for our four verb corpora. 


14 


m" 
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10 


VIN 


8 


ny m 


The tree makes explicit the similarities among its four corpora. The mn corpus is very different 
from the others, since it is a bit over 14 /ams from the cluster consisting of the other three corpora. ? 
The verbs nwy and qn3 are two to three times closer to each other than to the others, being separated 
from each other by 5 /ams. 


14. Among the options are: (1) the “complete link" (or “furthest neighbor") distance, wherein the distance from 
one cluster to some other cluster is the distance between the two furthest neighbors out of each of the clusters; (2) the 
"single link" (or "nearest neighbor") distance, wherein the distance from one cluster to some other cluster is the dis- 
tance between the two nearest neighbors out of each of the clusters; and (3) the “average” (or “unweighted pair-group 
method using arithmetic averages” = UPGMA) distance, wherein the distance from one cluster to some other cluster is 
the average of all distances between pairs of objects out of each cluster. On (1) and (2), see D. J. Hand, Discrimination 
and Classification (Chichester: Wiley, 1981) 164—69. On (3), see H. C. Romesburg, Cluster Analysis for Researchers 
(Belmont, CA: Lifetime Learning, 1984) 15-23. We use the "average" distance. 

15. The distance between clusters is the level of the horizontal line joining them as read from the scale on the left. 
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17.2.4 A Measure of Dendrogram Adequacy 
One more task is required. We need to indicate how well the dendrogram shown reflects the 
facts contained in the distance chart upon which it is based. There is a much-used statistic designed 
to do just this, the cophenetic correlation coefficient (CCC).'^ Its name makes it sound more arcane 
than it actually is. Its computation involves three simple steps: 
]. Recover the distances among the clustered objects as encoded in the dendrogram and form 
a distance table (the “cophenetic matrix"). This is done by “walking” from one object node 
to another, with the largest distance encountered along the way recorded as the distance. 
2. Lay out the original distances (the “Aitchison distances") and the distances represented in 
the dendrogram as two parallel lists. 
3. Compute the correlation between the two lists, the cophenetic correlation coefficient (CCC). 


The closer the CCC is to 1.00, the greater the fidelity of the dendrogram to the original distances. 
The usual threshold of acceptability is CCC > 0.80." 


17.3 Affinities for Two Large Collections of Verb Corpora 


17.3.1 Clauses Involving the Twenty Most Frequent Qal Active Verbs 

Late in chap. 11, we tabulated the 20 most-frequent verb roots, including all of their binyanim, 
along with their incidence counts. Here are the 20 most-frequent Qal active verb roots along with 
their incidence counts: '* 


Count 
nop! 590 

| n| 550 
545 

521 


Applying the methods sketched above in this chapter, we obtain the dendrogram for verb corpora 
containing the 20 most-common Qal active roots shown on p. 239. What does this dendrogram 
tell us at first glance? 


* The fairly low value of the CCC (0.84) tells us that there are relations among the verb 
corpora that are not fully represented by the cluster diagram (the clustering is more 
distortive than we could wish). 

* The fact that mn ‘be’ and nn ‘die’ are located very high in the diagram tells us that these 
2 roots are quite atypical among these frequent verbs in the way that they make use of the 
CIC repertoire. 

e Of the 20 verb corpora analyzed, the 82 corpus and the n5y corpus are most similar, 
joining as they do at the smallest distance apart (~1.7 lams). 


16. The best treatment of which we are aware is Romesburg, Cluster Analysis, 24—27. For one recent example of 
its use, see E. E. Kuramae et al., “Cophenetic correlation analysis as a strategy to select phylogenetically informative 
proteins: An example from the fungal kingdom," Bio-Med Central Evolutionary Biology 7 (2007) 134—44. 

17. Romesburg, Cluster Analysis, 27. 

18. Note that 727 and Dp are no longer present, while 83) and NU1 now are. 
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Cophenetic Correlation Coefficient = 0.84 
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There is more to be said, but first we must extract the clusters. 


17.3.1.1 Isolating Clusters 

We need to explain how the dendrogram allows one to define verb corpora clusters. Since one 
does not know ab initio how many clusters there are, one usually tries out several sets to see if any 
groupings of the objects seem superior to others. This is done by drawing horizontal lines at various 
levels across the dendrogram, all edges having been notionally extended to the zero level by pulling 
each leaf down to the zero level. Each sub-tree below the cut-level is then a cluster. 

The classification tree below (p. 240) should help make this clear. Each corpus-identifying He- 
brew root is a leaf on the classification tree. Before isolating clusters, we (notionally) drag each cor- 
pus label down to the zero level of the scale, lengthening the branch connecting to it. If a horizontal 
threshold drawn across the diagram intersects a branch connecting to a leaf or cluster of leaves, 
then the dangling cluster (or leaf) is a cluster as defined by that threshold level. At the far right of 
the classification tree is a pair of corpora, 178 and yT. When the cut threshold is set at 8 Jams (dot- 
ted line), each of the 2 corpora occupies its own (singleton) cluster. But when the cut threshold is 
set to 10 /ams (dashed line), then the 2 corpora are parts of the same cluster. For this dendrogram: 


* A cut-level at 10 /ams (dashed line) delimits 4 verb-corpus clusters. Two are strongly 
idiosyncratic corpora. These corpora do not fit in anywhere, at least as far as the present set 
of 20 verb corpora is concerned. 

e Cluster 1 (mn ‘be’) 
e Cluster 2 (ni ‘die’) 

e Cluster 3 consists of NX ‘say’ and yr ‘know’. 

* Cluster 4 contains 16 verb corpora. It divides into 2 major clusters if the cut-level 
is reduced to 8 /ams (dotted line): 

e à 6-corpus cluster containing a 5-corpus “movement cluster," labeled “M” 
e à 10-corpus cluster containing a 6-corpus “transitive cluster," labeled “T” 
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Cophenetic Correlation Coefficient = 0.84 
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17.3.1.2 The “Mindless” Clustering Algorithm 

The principle invoked for considering these clause types together ("frequently attested") in 
no way ensures that they fall into natural subgroups. And, in the event, they do not cluster into 
neat, highly compact and well-isolated clusters. Rather, several of the verb corpora just noted are 
strongly idiosyncratic. Further, the fairly low value of the cophenetic correlation coefficient (0.84) 
indicates that the dendrogram has done only a fair job of accounting for the distances separating 
the “top 20.” 

This points to a characteristic of the clustering algorithm that can lead to false inferences if one 
ignores the results-check provided by the cophenetic correlation coefficient (CCC): no matter what 
distances are supplied to the clustering algorithm, a dendrogram will be built. 

To make our point concretely, consider the sets of clauses containing one or the other of the 2 
Qal active verbs (occurring more than 60 times) that are farthest from each other, N30 ‘be unclean’ 
(75x) and pan ‘desire’ (78x). The CIC incidence bar chart for 80 is shown at the top of p. 241. 
The CIC incidence bar chart for pan is shown at the bottom of p. 241. 

Across the 103 sets of clauses having Qal active verbs and at least 60 clauses each, the average 
distance separating the corpora is 13 /ams. The distance separating the N20 and pan clause types 
is 22 lams, approaching double the average. Hence, the cluster of the 2 sets that would be formed 
by supplying only their incidences to the clustering algorithm would lack compactedness. Saying 
that the 2 sets are in the same cluster would be akin to placing an elephant and a sardine, say, in the 
same taxonomic cluster. While both are animals, on the surface they have little in common. 


17.3.2 Clauses Involving Frequent Verbs of Movement 


17.3.2.1 Qal Active Verbs of Movement 

When we clustered the 20 most-common Qal active roots in $17.3.1, the affinities among the 
verb corpora resulted by happenstance. In this subsection, we consider verb corpora involving 17 
Qal active verbs (each occurring more than 60 times) that we independently marked as having 
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simple movement semantics. To the extent that our assignments were valid, we expect the sets to 


form illuminating clusters. 
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Root| Gloss Count || Root| Gloss Count ||Root| | Gloss Count 
N3| go in 1,975 Tny| stand 435 D| flee 148 
Ton| walk 1,368 553| fall 356 yoi| break camp| 136 
Tv | go down 289 139, turn 116 
32v| lie down 191 3| sojourn 76 
aty| leave 182 paj run 73 

40] turn aside 175 


When we submit the distances between the corpora of the foregoing 17 Qal active roots to the clus- 
tering algorithm, we obtain the following dendrogram: 
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The cophenetic correlation coefficient (CCC) is gratifyingly large (0.91), indicating that the dendro- 


gram reflects the computed distances rather well. 


When the cut-level is at 5 Jams, we obtain a prototypical 5-corpus movement cluster (“M”) 
consisting of 83 ‘go in’, Nop ‘go up’, NY" ‘go out’, Ton ‘walk’, and Tv ‘go down’. Because the 
algorithm has identified a cluster, the next step would be to investigate which properties these verb 
corpora have in common, which would be the beginnings of producing a hierarchical lexicon of 


verbs. 


With regard to other aspects of the dendrogram, we note, among other things, that: 


e The words 32V ‘lie down’ and 73 ‘sojourn’ are not prototypical movement verbs. A 
complete analysis would seek the sources of their peculiarity. 
e We see that 715 ‘turn’, ary ‘leave’, and 10 ‘turn aside’ form a loose cluster, being separated 


by about 10 lams. These are not movement verbs of the sort in the prototypical cluster 


(which involve an agent’s moving from A to B). Rather, they involve an agent’s changing 


orientation (signaled by the mvt dir CIC). 
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* The other singleton clusters merit examination to identify the idiosyncrasies that lead to 
their relative isolation with respect to the prototype clause sets. 


In $17.2.2, we referred to geometrical methods of clustering but opted to use the clustering ap- 
proach relying on dendrograms. In this case, however, we now use a geometrical method? to check 
our results. When we plot the data for eight selected clause sets, the following diagram is the result: 


The five smaller dots (e) above the origin of the horizontal (x-y) plane correspond to the “M” clus- 
ter for &3 ‘go in’, ny ‘go up’, NWY ‘go out’, T2n ‘walk’, and TY ‘go down’ in the dendrogram. The 
larger dot (€) is i19 ‘leave’, the square (m) is 327 ‘lie down’, and the triangle (A) is 13 ‘sojourn’. 
17.3.2.2 Hiphil Active Verbs of Caused Movement 

We previously and independently found 2,347 Hiphil active verbs of caused motion. To allow 
for them to cluster differently from the Qal actives, we segregated them from the Qals. Thirteen of 
the Hiphil active forms appear more than 30 times (in 2,288 clauses, 97.5% of the total): 


Root 


N3| bring in 559 np, raise up = make fall 

3v| bring back | 353 ToU | throw Tn| make walk E: 46 
NY^| bring out mny | stand 

m>y| bring up 213 4uyp| make cross | 76 

Ap, present 185 T | bring down | 67 | 


19. Technical note. We used classical multidimensional scaling to project the data into three-space. Just over two- 
thirds of the variance is preserved in the plot. See Mark Davison, Multidimensional Scaling (New York: Wiley, 1983). 
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When we submit the distances among the corpora containing the foregoing 13 Hiphil active roots 
to the clustering algorithm, we obtain the following dendrogram: 
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Cophenetic Correlation Coefficient = 0.88 
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The cophenetic correlation coefficient is slightly less than that of the Qal active movement verbs. 


17.4 Affinities for Corpora Based on Single Binyanim, Etc. 


As our final probe of verb corpora affinities in this chapter and as preparation for taking up 
quasiverbal-predicator clauses (QVCs) in chap. 18 and verbless clauses (VLCs) in chap. 19, we 
next present a gallery of 7 large-corpora CIC-incidence bar charts for self-study. We then display a 
dendrogram that portrays the affinities among these corpora. 


17.4.4 A Gallery of CIC Incidence Bar Charts 
Here, we present a CIC-incidence bar chart for the sets of clauses based on: 


* The Qal active verbs 
e The Qal active verbs without m7 


We omit the mn clauses because of the strong difference between mn and other verbs (see pp. 245ff.). 
The main difference between these charts is the diminution of subject complement incidence in the 
lower chart. 

As the bar chart of CIC incidence for verbless clauses clearly reveals, they lack predicators 
(p. 247). 

Our glossary defines the quasiverbal as *a segment that does not have verb morphology but 
functions as a predicator. Included are: behold! (nàn), exists (W°), still (Tip), not-exists (PN) and, 
strictly speaking, where? (»N)." Because of their novel status, we have not included them among 
the predicators in our bar charts. Instead, they are treated in their own right in chap. 18. 
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CIC Incidence for Piels (DAs) 
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CIC Incidence for Niphals (NP) 
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CIC Incidence for Verbless Clauses (VLCs) 
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anchored verb 


inf. construct 
n-v participle 


unanchored verb 


CIC Incidence for Quasiverbals (QVs) 


verb participle 
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17.4.2 Dendrogram Based on CIC Incidences 


The table gathers the top 13 CIC incidences for the verb corpora that we have studied. 


anchored verb | 32.5% 


inf. constr. 

participle 2.896 3.1% 
unanchored 57.7% | 53.1% 
verb particip. 

subject 


direct object 
indirect object 


sbj/obj compl. 
location 
negation 


mvt aim 
mvt origin 


5.3% 


4.0% 


10.7% 


8.8% 


5.2% 


a. The first column is for the Qal actives without nn. 
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17.4.2.4 Clause Set Affinities, All CIC Incidences Included 
When we carry out our standard clustering on 7 of the sets of clauses in the gallery above,?? we 
obtain this dendrogram: 


Legend: 
= GA: Qal active 
i DA: Piel 
> HA: Hiphil 
- NP: Niphal 
u VLC: Verbless 
eo QV: Quasiverbal 
HYH: mn 
Cee — 0.07 
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This dendrogram does an excellent job of characterizing the distances separating the verb corpora, 
because its cophenetic correlation coefficient is 0.97. 

We see that the Piel (DA) and Hiphil (HA) corpora are closest to each other, with the Qal active 
and Niphals joining in a bit farther away. As usual, the 777 clauses “have a mind of their own." The 
verbless clauses and the quasiverbal clauses have affinities, being closer to each other than they are 
to the other sets of clauses. 


17.5 Brief Summary 


Generalization and the Hierarchical Lexicon. One path to useful generalizations regarding 
the verb stock of Biblical Hebrew is to locate each verb corpus in a taxonomy containing all 
verb corpora (“the hierarchical lexicon"). Statements made about any group of verb corpora (“verb 
classes") may then be powerful generalizations regarding its members. 

Inter-Clause Distance. We construct our hierarchical lexicon using the CIC incidence patterns 
associated with each verb corpus. This requires that we carefully define the distance between pairs 
of verb corpora, a technically demanding task. 

Inter-Clause Clustering. Given the table of distances separating the verb corpora, we are in a 
position to form groups of verb corpora using agglomerative clustering techniques. These methods 
find the hierarchy of verb corpora that best fits the table of distances. There is a statistic that then 
allows us to judge how well the hierarchy incorporates the distances on the basis of which it has 
been produced (the “cophenetic correlation coefficient"). 


20. We do not include the full set of Qal active clauses but, rather, separate the CIC incidences for mn from them. 
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Clustering the Top Twenty Qal-Active Verb Corpora. We cluster the corpora associated with 
the 20 most frequent Qal active verbs and then show how the resulting tree can be used to group 
the corpora. 

Clustering Verbs of Movement. We carry out analogous operations for the 20 most frequent 
verbs of movement, first using Qal actives and then using Hiphil actives. 

A Gallery of CIC Incidence Bar Charts. For reader study, we display an instructive gallery of 
CIC incidence bar charts: Qal actives, Qal actives without the 7°7-corpus, Piels, Hiphils, Niphals, 
verbless clauses, and quasiverbals. We use these CIC incidences to cluster the binyanim, verbless 
corpus, and quasiverbal corpus. We find, for example, that the verbless corpus has much in com- 
mon with the quasiverbal corpus, that the m7 corpus stands alone, and that the Piels and Hiphils 
have more in common with each other with regard to CIC incidence than they do with the Qal ac- 
tives and Niphals. 


Chapter 18 


The Five Quasiverbals 


In this chapter, we examine characteristics of the five lexemes that we grouped under the head- 
ing Quasiverbals in our tables of parts of speech in §3.2. This list is reordered from §3.2.5: 


V^ exists pR does notexist “iy still nin behold mx where? 


Approximately 720 of these lexemes combine with verbal elements to produce compound predica- 
tors. Because 75% of these involve the phenomenon of discontinuity, we discuss them in chap. 20. 
Here we deal with the 1,213 clauses in which these items appear as simple predicators. 

In this chapter, we address three issues: (1) the way the five lexemes are characterized in stan- 
dard reference works, (2) the patterns of CIC usage exhibited by each, and (3) the affinities that the 
five exhibit. 


18.4 Standard Biblical Hebrew References on the Five Lexemes 


18.1.1 How They Are Described 
w exists. The lexeme v^ has received several classifications: ! 
e Lambdin: “predicator of existence”? 
e IBHS: “quasi-verbal indicator” and “predicator of existence”? 
* Van der Merwe et al.: “predicator of existence"* 


e Joüon/Muraoka: “adverb of existence"? 
e Williams: “existential particle" $ 


The fact that V» takes suffixes is frequently noted.’ 
PS does not exist. This lexeme® has also received diverse classifications: 


1. The homograph involving V? occurs once, in Prov 8:21. There, W? is a noun that is glossed ‘substance’. 

2. Lambdin, 165 (all references to Hebrew grammars in this chapter are by author's name or by acronym only; 
consult the bibliography for full information). 

3. IBHS, 72 and 623. 
. Van der Merwe et al., 320. 
. Joüon /Muraoka, 541. 
. Williams, 170—71. 
. BDB, 441; Lambdin, 165; Blau, 78n; Jotion/Muraoka, 320; Williams, 170-71. 

8. Homographs: Seven times pX is classified as a noun, glossed ‘nothing’. Once px (Isa 40:23) is a noun, also 
glossed ‘nothing’. Forty-one times PX is a construct noun, glossed ‘nothing of’. 
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BDB: “particle of negation”? 
Lambdin: “predicator of non-existence 
IBHS: “clausal adverb" and “predicator of non-existence” !! 

Van der Merwe et al.: “negative adverb"? 

Joüon /Muraoka: “negative adverb" and “adverb of non-existence 
Williams: “[t]he negative" ^ 


2°10 
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The fact that it takes suffixes is frequently noted. ^ 
Tiv still. The classifications of tiy'® are less scattered than those of several of the other lex- 
emes here considered. We find: 


BDB: “used mostly as adv. acc. still, yet, again, besides . . .”"” 

Lambdin: “In verbal sentences, Tiv is used as a simple adverb in the sense of ‘again, still, 
yet, once more'."!* 

IBHS: “constituent adverb, qualifying the time extent of the predicate." '? 

HALOT.: "[substantive] repetition, duration [develops into] adv. again, still (with fluid 
transition) from one to the other."?? 

Van der Merwe et al. as well as Joüon / Muraoka: “adverb.”?! 


The fact that the lexeme can take a pronoun suffix is repeatedly asserted.” 


nJ 


1 behold. The lexeme? has also received several distinct classifications: 


BDB: “demonstrative particle.” 

Lambdin: “predicator of existence emphasiz[ing] immediacy, the here-and-now-ness of the 
situation." ?? 

IBHS: "particle," “demonstrative adverb," “deictic particle," and “presentative. 
HALOT. “deictic and interrupting interjection."? 

Van der Merwe et al.: “discourse marker."?* 


2926 


. BDB, 34. 

. Lambdin, 165. 

. IBHS, 623. 

. Van der Merwe et al., 318. 

. Joüon/Muraoka, 306 and 541. 

. Williams, 146—48. Williams specifies eight different uses of the lexeme. 
15. 


BDB, 34; Lambdin, 165; JBHS, 661; Williams, 146. Also, J. Blau, A Grammar of Biblical Hebrew (2nd ed.; 


Wiesbaden: Harrassowitz, 1993) 78n. 


16 


. We have divided this form into three homographs: two are temporal adverbs, one durational (glossed ‘still’, 


317x) and the other repetitive (glossed ‘again’, 60x). The rest are classified as quasiverbals (116x). 


17 
18 
19 
20 
21 
22 
23 


. BDB, 728. 

. Lambdin, 171. 

. IBHS, 657. 

. HALOT, 796. 

. Van der Merwe et al., 308, and Joüon / Muraoka, 307. 

. BDB, 728-29; Lambdin, 171; Blau, 78n; HALOT, 796; Joiion/Muraoka, 307. 

. We split the forms into three homographs: a spatial adverb (glossed ‘here’, 268x), an exclamative (glossed 


*behold" 19x), and a quasiverbal (glossed ‘behold’, 912x). 


24 
25 
26 
27 
28 


. BDB, 243. 
. Lambdin, 168. 


. IBHS, 300, 307, 635, and 675. 
. HALOT, 252. 
. Van der Merwe et al., 328. 
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e Joüon/Muraoka: “presentative adverb.” 
e Andersen: “positive perspectival presentative predicator.” *° 


Most resources note that this lexeme can and does take pronoun suffixes.?! 

WR where? It is typically stated or implied that 78 is an interrogative or interrogative adverb.?? 
That it can take suffixes is commonly noted.” It is asserted that it is not used with verbs: 

e BDB: “used of both persons and things (but never with a verb [contrast n&s]).?* 


e IBHS: “not used with verbs."? 
e HALOT.: “never before verbs, always in direct questions.” *° 


Against these assertions, we find the lexeme in five verbal clauses: Gen 16:8, 2 Sam 1:3, Jer 5:7, 
Job 2:2, and Qoh 11:6. In these, we recognize a pure interrogative. 


18.1.2 Verb-Like Behavior 


In the foregoing, the notion of the lexemes’ exhibiting predicative function appeared for v» ‘ex- 
ists’, PX ‘does not exist’, and 7375 ‘behold’. Indeed, the concept of “quasi-verbal indicators" even 
appeared, in the Waltke and O'Connor ZBHS discussion of w ‘exists’. Along these lines, Lambdin 
remarks that V^ and Ps “approximate a verbal function in Hebrew, serving almost as tenseless 
forms of the verb ‘to be’.”*” He further comments that “Tiy, like 437, may be inflected and used as a 
predicator of existence, with the nuance of ‘to still be, to yet be.'. . . Tip, like 739, may be extended 
from purely existential predication to use in other types of non-verbal sentences."?* We are aware 
of only a single dissenting voice, GKC:?? 

The usual explanation of [adverbial] suffixes (especially of the forms with Niin energicum) as verbal 

suffixes, which ascribes some power of verbal government even to forms originally substantival (e.g., 


iv^ there is, he is), is at least inadmissible for forms . . . which are evidently connected with noun suf- 
fixes; even for the other forms it is questionable. 


No evidence for the final assertion is provided. 


18.2 Existentials 


18.2.1 Biblical Hebrew Background 


The lexemes v» and pX are typically referred to as existentials. In Biblical Hebrew reference 
works, there is general agreement about the verb-like behavior of these lexemes. We find: 


29. Joüon/Muraoka, 351. 

30. E I. Andersen, “Taxonomy and Translation of Biblical Hebrew nin,” in Hamlet on a Hill: Semitic and Greek 
Studies Presented to Professor T. Muraoka on the Occasion of His Sixty-Fifth Birthday (ed. M. E. J. Baasten and W. T. 
van Peursen; Leuven: Peeters, 2003) 56. 

31. BDB, 243; Lambdin, 168; Blau, 78n; JBHS, 675; HALOT, 252; Joiion/Muraoka, 307. 

32. BDB calls it an “interrogative adverb," 32, as do Joüon /Muraoka, 306. HALOT, 39, calls it an interrogative; 
this is implied by ZBHS, 328, and by van der Merwe et al., 326. 

33. Lambdin, 172; Blau, 78n; HALOT, 39. It is suffixed in 9 of 83 occurrences. 

34. BDB, 32. 

35. IBHS, 328. 

36. HALOT, 39. 

37. Lambdin, 165. 

38. Ibid., 171. 

39. GKC, $1000, 297. 
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e As noted earlier, van der Merwe et al. assert that V^ and ps function as “predicators of 
existence." ^? 
o6 


e BDB comments that W? ‘exists’ “asserts existence, and so corresponds to the verb 


substantive, is (are, was, were, will be). . . . On this word, see esp. [Nóldeke], who 
exemplifies its different constructions in Semitic, and [shows] how it tends to pass into a 
verb."^! 


18.2.2 General Linguistics Background 

In general linguistics, there is a surprisingly substantial literature on existentials. Central to this 
literature are the concepts of pivot” and of a definiteness effect. Careful study of pivots indicates 
that the distinction between subjects and pivots is not valid for Biblical Hebrew. Nor do we see any 
need for a definiteness effect, an impression reinforced by Francez, who refers to "languages like 
Hebrew that do not exhibit a definiteness effect."^* Nonetheless, there are some observations in the 
literature that are relevant to Biblical Hebrew: 


* "By far the most common answer to the question of the semantic structure of existentials in 
the literature is that existentials are semantically locative predications." ^ 

* "[I]n many, and perhaps in all, languages existential and possessive constructions derive 
(both synchronically and diachronically) from locatives. . . . [I]t might appear reasonable 
to say that all existential sentences are at least implicitly locative (the term ‘locative’ being 
taken to include both temporal and spatial reference)."*6 

* Most Biblical Hebrew grammarians, we daresay, would agree with Falk's assertions: ^ 


[Y]es is often analyzed as if it were a verb, either the present of haya or something essentially simi- 
lar... . While the motivation is clear (the paradigmatic relation between haya and yes), it is also obvi- 
ously the case that yes is not a verb. It appears to be a noun, as does its negative eyn. . . . In all their 
uses, yes and eyn conform to our claim that they are categorically nouns but functionally verb-like. 


* Falk also observes that “yes is used in locative, existential, and possessive constructions." ^ 


40. Van der Merwe et al., 318. In addition, Px "[n]egates events to which a participle refers.” 

41. BDB, 441. 

42. David Beaver et aL, “Bad Subject: (Non-)canonicality and NP Distribution in Existentials," Semantics and 
Linguistic Theory 15 (2005), www.linguistics.ucla.edu/salt/. 

43. E. L. Keenan, “The Definiteness Effect: Semantics or Pragmatics?” Natural Language Semantics 11 (2003) 
187-216. 

44. Itamar Francez, “Semantic structure and argument realization in (mostly Hebrew) existentials," 2006. On the 
web at: http://home.uchicago.edu/-ifrancez/IATLOT7.pdf. Note that Modern Hebrew is involved here. 

45. Ibid., 3. Precisely for this reason, these sorts of location CIC might be considered core, not only in predicates 
in verbless clauses, but also in m7 clauses. 

46. John Lyons, “A note on possessive, existential, and locative sentences," Foundations of Language 3 (1967) 390. 

47. Yehuda N. Falk, “The Hebrew Present-Tense Copula as a Mixed Category," in Proceedings of the LFG04 
Conference (ed. M. Butt and T. H. King; Stanford, CA: CSLI, 2004) 229 and 241. Note well that this essay deals with 
Modern Hebrew. Some of its assertions are incorrect for Biblical Hebrew. For example, it refers to “the impossibility of 
[ves] appearing with the negative Jo” (p. 231), but see Job 9:33, where we find: rr2in aryz^v» uo. 

48. Ibid., 236. Compare Williams, 170-71. 
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18.2.3 CIC Incidences for Existentials in Biblical Hebrew 
18.2.3.1 V? 

The CIC-incidence bar chart documents that clauses containing V» commonly include a subject 
CIC (almost 90% of the time) and include locations (about one-third of the time), possessors, and 
interrogatives (each about one-fifth of the time), and accompanier, and aim/purpose CICs (9% and 
6% of the time). 


CIC Incidence for v^ 
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anchored verb 
inf. construct 
n-v participle 
unanchored verb 
verb participle 
subject 

direct object 
indirect object 
subj+obj cmpl. 
location 
negation 
movement aim 
movement origin 
possessor 
interrogative 


accompanier 
aim/purpose 


It is useful to proceed further and observe the way that these CICs co-occur.” 


e There are 3 contexts (4 instances) where V occurs alone, always in answers. ?? 

* There are 38 two-CIC clauses involving v».?' More than half of these simply consist of a 
QV sbj.? 

e There are 47 w-clauses with 3 noncoordinating conjunction CICs. Closed interrogatives 
occur 3 times, always in lead position: interrog QV. Five additional times, V? is not in 
lead position. Three of the CIC sequences following the 37 clauses with initial w exhibit 
alternating CIC order: 


sbj loc 8 poss sbj 6 sbj acc 3 
loc sbj 8 sbj poss 1 acc sbj 1 


49. Note: The Aramaic equivalent of v», "mx ‘exists’, is included in these tallies. In passing, we observe that the 
Aramaic form is preceded by Nb ‘not’ six times: Dan 2:10, 11 (2x); 3:29; 4:32; Ezra 4:16. 

50. 1 Sam 9:12, 2 Kgs 10:15 (vn v»), Jer 37:17, and Dan 3:1. 

51. Two of these involve ellipted w*: Jer 5:1 and Prov 13:7. Eight have an introductory “and” (7x) or “also” (1x). 

52. Sbj QV order occurs once, in | Sam 21:5. 
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e Thirty-nine w?-clauses consist of 4 CICs, with 38 containing a subject CIC. Fourteen begin 
with the sequence interrog QV, and a closed interrogative is used in all 14 of these. 
e Six clauses consist of 5 or more CICs, too few clauses for useful generalizing. 


18.2.3.2 PX 

We turn next to the negative polarity existential, px ‘does not exist’. This lexeme appears in 
compound verbal structures 129 times. We focus here on its 594 appearances as a clausal CIC in its 
own right. The CIC incidence bar chart reveals the sorts of CICs that appear along with it. We see 
that clauses containing PN commonly include a subject CIC (about 90% of the time) and include 
locations (26% of the time), possessors (18% of the time), and aim/purpose and accompanier CICs 
(each around 6% of the time). Note that, unlike the situation with V^ (21.1%), there are fairly few 
interrogatives with Px (3.4%). 


CIC Incidence for px 
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anchored verb 
inf. construct 
n-v participle 
unanchored verb 
verb participle 
subject 

direct object 
indirect object 
subj+obj cmpl. 
location 
negation 
movement aim 


movement orig 
possessor 
aim/purpose 
accompanier 


It is interesting to observe how these CICs co-occur. 


* There are 18 contexts where PN occurs alone in a clause. Six of these involve pxo,” and 
2 involve P872.?* 

e There are 204 two-CIC clauses involving px.” More than 85% of these simply consist of 
QV sbj. 

* There are 255 px clauses with 3 noncoordinating conjunction CICs. The existential is in 
lead position 71% of the time. Two CIC sequences following the 179 clauses with initial PN 
exhibit alternating CIC ordering: 


53. Exod 17:7, 32:32; Judg 9:15, 9:20; 2 Sam 17:6; 2 Kgs 2:10. 

54. 1 Sam 10:14; Job 35:15. 

55. Two of these involve ellipted w7: Jer 5:1 and Prov 13:7. Eight of these have an introductory “and” (7x) or “also” 
(1x). 
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sbj loc 41 poss sbj 31 
loc sbj 22 sbj poss 15 


* 95 px-clauses consist of 4 CICs, with 91 of these (96%) containing a subject CIC. 
* 20 clauses consist of 5 or more CICs, each exhibiting a unique sequence of CICs. 
18.3 Tip 


The CIC-incidence bar chart for Tiy highlights the range of CICs observed in its 86 appearances 
as a noncompounded clausal predicator. 


CIC Incidence for Tiy 


100 


" [B I mmm 


anchored verb 
inf. construct 
n-v participle 
unanchored verb 
verb participle 
subject 

direct object 
indirect object 
subj+obj cmpl. 
location 
negation 
movement aim 
movement origin 
interrogative 
possessor 

time point 

time goal 


It is interesting to observe how these CICs co-occur: 


* There are no QV-only clauses. 

* Two-CIC clauses occur 22 times, 19 (86%) being QV sbj. 

* There are 41 three-CIC clauses. These are QV-initial 90% of the time. 
* There are 22 four-CIC clauses. Five (23%) are interrog QV sbj sbj-cmp. 
* The 6 clauses having 5 CICs share little with regard to CIC sequencing. 


18.4 nin 


This CIC-incidence bar chart for 737 highlights the limited range of CICs observed in its 350 
appearances as a noncompounded clausal predicator. 
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CIC Incidence for nin 
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A subject appears in 9246 of the clauses, while subject complements and locations appear in almost 
30%. Comparison and vocative CICs each appear in slightly over 5% of the clauses. 
As for the co-occurrence patterns of the CICs, we find: 


* There are no QV-only clauses. 

* There are 107 two-CIC clauses, all but 1 being QV-initial. The preponderant sequence is 
QV sbj (8196). 

* There are 140 three-CIC clauses, all but 3 being QV-initial (98%). For the 137 QV-initial 
clauses, the patterns of the final 2 CICs primarily are: 


sbj sbj-cmp 39 sbj loc 33 
sbj-cmp sbj 12 loc sbj 9 


* There are 74 four-CIC clauses exhibiting around 50 patterns. 
* There are 19 five-CIC clauses, 3 six-CIC clauses, and 1 seven-CIC clause. 


18.5 pX 


This CIC-incidence bar chart for mX highlights the limited range of CICs observed in its 59 
appearances as a noncompounded clausal predicator. 


56. The clause is in Ps 123:2. Four of its CICs are “pulled up" from the following clause. 
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The CIC co-occurrence patterns are: 


e There is 1 context where MX occurs alone in a clause: Job 15:23. All but 2 of the clauses 
contain a subject. 

* There are 48 two-CIC clauses involving mX, all consisting of QV sbj. 

e There are 8 mX clauses with 3 noncoordinating conjunction CICs. The lexeme is in lead 
position in all but one, Zech 1:5. 

* One PN clause consists of 4 CICs: Jer 5:7. 


18.6 Affinities among the Five Lexemes 


We naturally wonder what affinities exist among the quasiverbals. Following the procedures 
explained in chap. 17 and more rigorously in appendix 5, we can use the CIC incidences for the 5 
quasiverbal lexemes to compute the distances separating the lexemes and then use these to perform 
hierarchical clustering. We obtain the dendrogram shown on p. 260. The cophenetic correlation 
coefficient for the tree is 0.94, indicating an excellent fit to the data. Note that the 2 existentials 
are closest together, being separated by 5 lam units. To this prototypical cluster, 737 joins in; then 
comes Tiy, followed by 7x. Put differently, the existentials are the most prototypical of the qua- 
siverbals, while WX is the least. 


18.7 Brief Summary 


Standard Treatments of Quasiverbals. For some of the items that we group as quasiverbals, 
the standard Biblical Hebrew literature recognizes predicative functions. This is especially true of 
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the existentials. But the 5 lexemes are not standardly grouped into a family. As far as we know, our 
work is the first time that these 5 lexemes have been grouped together as a distinct part of speech. 

Patterns of Quasiverbal Usage. When we examine its CIC incidence bar charts, we find that 
the quasiverbal corpus is uniform in its preference for explicit subjects. While the location SR is 
fairly frequent, explicit negation is rare. Subject complements occur fairly frequently with Tip ‘still’ 
and nin ‘behold’, but not with the other 3. 

Quasiverbal Affinities. When we cluster the quasiverbals, we find that V» ‘exists’ and PX ‘does 
not exist’ (the existentials) are most similar, with nn ‘behold’ more distant, Tip ‘still’ even more 
distant, and 7°8 ‘where?’ the mixed lexeme, most idiosyncratic of all. 


Chapter 19 


Verbless Clauses 


We here discuss the major issues pertaining to clauses having neither verbals nor quasiverbals.! 


19.1 Verbless/Nominal Clauses 


19.1.1 Naming Conventions 
The special kinds of clausal entity that we are about to consider have two naming conventions 
based on the characteristics of their clausal predicates: 


* The older convention focuses on what the predicates supposedly contain, following 
Albrecht in calling the clauses Nominalsatzen (“nominal sentences") or nominal clauses.? 

* The more recent convention emphasizes what the predicates lack, following Andersen in 
calling the clauses verbless clauses or nonverbal clauses.* 


The former convention is preferred by scholars in the European tradition, the latter by those in the 
North American tradition. In a survey volume on our topic containing essays by 12 contributors, 5 
of 6 European contributors refer to the clauses as nominal, while 5 of 6 North American contribu- 
tors refer to the clauses as verbless.* The reference works fluctuate in use between “nominal clause” 
and “verbless clause”: 


e Gesenius /Kautzsch uses “die Nominalsátze,"? which Cowley translates *noun-clause."ó 

* For Joüon's “Proposition nominale,”’ Muraoka adopts the English term “nominal clause” 
and retains Joüon's mention of both conventions: “The category of nominal clause includes 
every clause the predicate of which is a noun or the equivalent of a noun, i.e., a participle, 
a preposition with a noun or pronoun, etc.; or, put negatively, every clause the predicate of 
which is not a verb . . . is a nominal clause."? 


1. We might write predicator-less structures rather than verbless clauses, since a few of the structures to be consid- 
ered are not clauses, in that they lack predicates. These structures are dealt with in $19.9. For definitions of clause, see 
$7.1. From our four types of participles (83.2.4.2), we include only the pure noun participles. 

2. C. Albrecht, “Die Wortstellung im hebräischen Nominalsatze," ZAW 7 (1887) 218—24; 8 (1888) 249-63. 

3. EI. Andersen, The Hebrew Verbless Clause in the Pentateuch (Nashville: Abingdon, 1970). 

4. Cynthia L. Miller, ed. The Verbless Clause in Biblical Hebrew: Linguistic Approaches (Linguistic Studies in 
Ancient West Semitic 1; Winona Lake, IN: Eisenbrauns, 1999). 

5. W. Gesenius and E. E. Kautzsch, Hebrüische grammatik (27th ed.; Leipzig: Vogel, 1902) 458. On the Web at 
books.google.com. 

6. GKC, 451. 

7. Joüon, 466. 

8. Ibid., 528. 
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e Waltke and O'Connor have a chapter on verbless clauses.’ 

* Van der Merwe et al. opt for “nominal clause” defined by what is absent: “Nominal clauses 
refer to clauses in [Biblical Hebrew] that do not contain a finite form of the verb.” !° 

* In his index under “nominal clause," Williams has “See verbless clause.” 


Name choices are nearly equally divided between the two names. Where do we come down? We 
agree with Baasten: 


[T]he predicate of a non-verbal clause is not necessarily a noun phrase. As no-one seems to deny, 
the predicate of such clauses may also be, among other things, a prepositional phrase or an adverbial 
phrase, constituents that cannot properly be called ‘nominal.’ !! 


Dryer states that "there are three types of clauses with nonverbal predicates whose properties 
vary considerably across languages. These are adjectival predicates, nominal predicates, and loca- 
tive predicates.” ? Biblical Hebrew includes these three sorts of nonverbal predicates, and more. 
Hence, we will continue to gather every kind of clause that lacks a verbal or quasiverbal predi- 
cator under the term “verbless clause." We will sometimes abbreviate “verbless clause" as VLC 
(^v?lick"). 

Cameron Sinclair sees introduction of the nominal /verbless clause type as unnecessary. He 
seeks a "single unified description of the syntax of clauses employing the copula mn and of nomi- 
nal clauses." ? To achieve this goal, Sinclair identifies and provides examples of a range of constitu- 
ent types that are “complements both in clauses with 777 and in verbless clauses." '* Based on these 
parallels, he asserts that 777 clauses and nominal clauses “are really not two clause-types at all but, 
rather, variants of a single type in which the verb occurs when it is needed ... but is otherwise 
simply omitted.” The fact that a set of constituents can be defined as appearing both in i77 clauses 
and in verbless clauses is not a sufficient condition for declaring the two “variants of a single type.” 
To make this declaration, one must show in full detail that they are typological siblings. 

19.1.2 Accepted Properties of Verbless Clauses 

Although the choices of name for the clause type are nearly evenly divided, there is near uni- 

versal agreement regarding several characteristics of VLCs: ? 


1. Most VLCs contain two CICs. The orthodox view is that verbless / nominal clauses 
are bipartite. Something like this is assumed in most reference works. A few references 
deal with tripartite verbless clauses, a controversial topic touched on in $19.11. As for 


9. IBHS, 125-35. Waltke and O'Connor are aware of the multiplicity of possible predicators that appear in verb- 

less clauses (pp. 72-73). 

10. Van der Merwe et al., 361. Several times they include both terms, “nominal clause (or verbless clause),” for 
example, on p. 63. 

11. Martin E J. Baasten, The Non-Verbal Clause in Qumran Hebrew (Ph.D. diss., Leuven University, 2006) 14-15. 

12. Matthew S. Dryer, “Clause Types,” in Language Typology and Syntactic Description, Vol. 1: Clause Structure 
(2nd ed.; ed. Timothy Shopen; Cambridge: Cambridge University Press, 2007) 224. 

13. Cameron Sinclair, “Are Nominal Clauses a Distinct Clause Type?" in The Verbless Clause in Biblical Hebrew 
(ed. C. Miller; LSAWS 1; Winona Lake, IN: Eisenbrauns, 1999) 52. 

14. Ibid., 75. 

15. When checks of the asserted properties of verbless clauses are easily feasible, our results appear as underlined 
text. 
^. 16. IBHS, 131; van der Merwe et al., 252; Joüon/Muraoka, 538-43; Williams, 50. 
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non-two-CIC VLCs, the literature is basically silent." We break the silence in §19.9 and 
$19.11—12. In our mark-up of the verbless clauses, 57% (5,453/9,500) have only two 
CICs. 

2. Analysis is formulated in terms of subjects and predicates. This is the reigning 
paradigm. '® 

3. The subject of a VLC is “generally” a substantive or pronoun. Typically, a writer 
provides a few examples of the major constituents and then of the less common 
constituents. '? We find that 56.4% of the subjects of VLCs are phrasal substantives, 14.7% 


are segmental substantives, and 28.0% are pronouns, leaving 0.9% for all other kinds of 


constituents. 

4. The predicate of a VLC can be any of several constituent types. The predicate may be 
a substantive, adjective, participle, numeral, pronoun, adverb, or prepositional phrase 
specifier of time, place, quality (according to GKC).?? We agree with this assertion for 
reasons that will become clear in §19.9-§19.11. 

5. Substantive-substantive VLCs identify or classify”! their subjects. This distinction 
originated independently with Andersen and Muraoka? and has been universally adopted.” 
A clause of identification tells who or what is the subject, while a clause of classification 
specifies an attribute or a relationship of the subject.?* 

6. Identification clauses tend to be S + P, while classification clauses tend to be P+ S. In 
these assertions, much hinges on the specifics of "tend." ?? Various factors can affect the 
ordering of the two major constituents.” See $19.10.2.4. 

7. Other testable generalizations are found. We check these two generalizations: 


a. “The subject usually stands first in the clause.”?’ Across all of the structures without 
predicators, this is not true. Only 43% of them lead off with their subject. 


17. Regarding monopartites, see the very brief passing references in Joüon/Muraoka, 528, 529, 537. There is a 
three-page chapter on quadripartite nominal clauses in Wido van Peursen, Language and Interpretation in the Syriac 
Text of Ben Sira (Leiden: Brill, 2007) 306-9. 

18. So GKC, 450; IBHS, 130; van der Merwe et al., 252; Joüon/Muraoka, 528; Williams, 206. 

19. So GKC, 451 (*may be substantive or pronoun") and Joüon/Muraoka, 528. Joüon, in the first edition, com- 
ments: “De plus, le sujet peut être: (1) Une préposition avec son nom (ou pronom). . . . (2) Un infinitif construit. . . . 
(3) Raremont un infinitif absolu: Pr 25,27" (pp. 466—67). 

20. GKC, 451-52; van der Merwe et al., 248, 275; Joüon/Muraoka, 529, add infinitive construct; Williams, 
199-200. 

21. Muraoka refers to “description” rather than classification. 

22. Francis I. Andersen, The Syntax of Biblical Hebrew (Ph.D. diss., Johns Hopkins University, 1960). The chapter 
on verbless clauses in Genesis was expanded and published as The Hebrew Verbless Clause in the Pentateuch. Taka- 
mitsu Muraoka, Emphasis in Biblical Hebrew (Ph.D. diss. Hebrew University, Jerusalem, 1969). Published as Emphatic 
Words and Structures in Biblical Hebrew (Jerusalem: Magnes / Leiden: Brill, 1985). 

23. So IBHS, 130; van der Merwe et al., 248; Joüon / Muraoka, 530; Williams, 198, 206. 

24. IBHS, 130 and 132. 

25. IBHS, 130-35; Williams, 207; Baasten, Non-Verbal Clause, 72-79. 

26. Ordering is said to be conditioned on emphasis or prominence (notoriously elastic concepts) in both GKC, 454 
(“predicate-subject . . . must be used when emphasis is laid on the predicate") and Joüon/Muraoka, 532-36. This expla- 
nation goes back to Albrecht, well summarized by Baasten, Non-Verbal Clause, 63—66. We read: “A personal pronoun 
tends to occupy the second slot when no prominence is intended to be given to it" (Joüon/Muraoka, 532). 

27. Van der Merwe et al., 248. 
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b. “In purely statistical terms, the word order within the nominal clause is often 
S-P, which is true in roughly two out of every three cases.”** As stated, this is a 
weak assertion. Working only with structures having adjacent subjects and subject 


complements, we find that the order is S-P in 58% of cases. 
19.2 The Layered Structure of Verbless Clauses 


One may probe verbless clauses using ideas from Van Valin’s role and reference grammar 
(RRG): 


[RRG] posits three main representations: (1) a representation of the syntactic structure of sentences, 
which corresponds closely to the actual structural form of utterances [the layered structure of the 
clause], (2) a semantic representation representing important facets of the meaning of linguistic ex- 
pressions, and (3) a representation of the information (focus) structure of the utterance, which is re- 
lated to communicative function.” 


Van Valin’s “layered structure of the clause” was devised to describe verbal clauses: 


On this view, the primary constituent units of the clause are the ‘nucleus’, which contains the [predica- 
tor] (usually a verb), the ‘core’, which contains the nucleus and the arguments of the [predicator], and 
a ‘periphery’, which subsumes non-arguments of the [predicator], e.g., setting locative and temporal 
phrases. *° 


The layered structure may be schematized as followings: 


Clause 
Core Periph 
Lead Zone Prenucleus Nucleus Postnucleus ( ne 
Zone (Predicator) Zone 


We have extended the structure by converting the argument slot into a prenucleus zone and a post- 
nucleus zone and by changing “precore slot" to a less theory-specific “lead zone” and “argument” 
to “zone.” We introduce the layered structure model here because it approximates the sort of model 
assumed in the traditional verb-centric Biblical Hebrew literature. We will see that this model is of 
limited usefulness when verbless clauses are being described. 

According to orthodoxy, in this representation the subject of an identification clause is in the 
prenucleus zone, while the subject of a classification clause is in the postnucleus zone. To maintain 
contact with the linguistics literature, we have kept the left-to-right ordering of the zones, even 
though for Biblical Hebrew texts the zones are ordered right-to-left, and for our phrase markers, 
top-to-bottom. 

A VLC may differ from other VLCs with regard to the contents of any of its constituents: 
(1) lead zone, (2) prenucleus zone, (3) nucleus (predicator) zone, (4) postnucleus zone, and (5) pe- 


28. Joüon/Muraoka, 531. 

29. Robert D. Van Valin Jr., Exploring the Syntax-Semantics Interface (Cambridge: Cambridge University Press, 
2005) 1. See also our §12.5.1. 

30. Van Valin, Exploring, 4. We have adjusted terminology to comport with our usage. 
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riphery. The number of possible combinations of these elements in verbless clauses is enormous, 
and their communicative functions are myriad. When clauses contain neither verbal nor quasiverbal 
constituents, these zonal distinctions are neither very helpful nor very convincing. Consider: 


* Nucleus. The linchpin nucleus is no longer identifiable on the basis of morphology. 

It must be identified with reference to its surroundings, be they local—using relative 
definiteness clues within the structure, or more remote—using “topic” (old information) and 
"comment" (new information) distinctions based on surrounding discourse. 

* Arguments. It makes little sense to refer to “the arguments of the predicator" when said 
predicator is, say, a noun phrase. 

* Periphery. While Van Valin specifies that locative and temporal phrases are in the 
periphery, Dryer states that nonverbal clauses can have “adjectival predicates, nominal 
predicates, and locative predicates."?' So, should locatives in verbless clauses be assigned 
to a periphery, or to a core, or now to one and then to the other? 


19.3 Elliptic Verbal Clauses 


Before examining the verbless clauses, we touch on clauses that masquerade as verbless. The 
representation of many of these involves non-tree phrase markers, so they are treated in chap. 20. 
19.3.1 Adjacent Ellipted Predicators 

Many of these sorts of clauses are “completed” by pulling in a nearby ellipted predicator. Phrase 
marker (19.1) from Isa 64:9 illustrates the sort of behavior involved: 


(19.1) sent ^. cl sbj " 
juxt oblq \ gram my Zion 
x sbj cmp 
X gram ——^ aT desert 
v cl vb 


— d W) 
oblq N gram nm" she was 
sb) e DoUtrv Jerusalem 


sbj cmp 


gram ^ mW devastation 


As represented, the first clause “pulls in” its verb from the second clause. The Nyps puts the verb in 
the first clause: “Zion has become a desert, Jerusalem a desolation.” In both analyses, these sorts 
of clauses require non-tree representations (since some nodes have more than one mother) and so 
are discussed in §20.2.3. 


19.3.2 Nearby Predicators 

There are clauses whose completion depends not on a nearby predicator—typically concor- 
dant—but rather on a more remote—typically discordant—predicator. This phenomenon often oc- 
curs in conversation. In phrase marker (19.2) from 1 Kgs 20:14, the predicator in the question (10x? 
‘he will bind’) suggests an (ellipted) predicator for the answer ('thou[m] wilt bind’) in (19.3): 


31. Dryer, “Clause Types,” 224. 
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(19.2) a =. = ^ and 
UV. uo NN’ he said 
a. Ed ofa "d boar M — €. 1102 
NS bod bx? he will bind 
dir obj ^ , nwar "^ the 
gram def i 
b mand war 
(19.3) RA "M pin " and 
N vb NN? he said 


gram 


obj addr cl sbj 


gram oblq gram —7 ^N thou(m) 


19.4 Dealing with Rare CICs and Syntactic Isolates 


For our analyses, we set aside clauses containing any CIC that occurs fewer than 30 times and 
also ignore all instances of syntactic isolate CICs, retaining the reduced clauses. For the record, we 
here report on rare CICs and syntactic isolates in clauses without predicators. 


19.4.1 Instances of Rare CICs 


The table (see p. 267, top) provides a census of the 24 CIC types that occur fewer than 30 
times in the 9,500 VLCs. The 223 rare CICs identify 221 clauses for exclusion (they appear in 2.3% 
of the VLCs). 


19.4.2 Instances of Syntactic Isolates 

Syntactic isolates are simply removed from the clauses in which they appear, leaving the clauses 
in play unless a “clause” consists of syntactic isolate(s) and, possibly, a coordinating conjunction. 
There are 344 structures that consist of a single syntactic isolate (110 solo exclamatives, 201 solo 
labels, and 33 solo vocatives). Eighty-two structures consist of 2 syntactic isolates. In 3 cases, a 
single isolate follows a coordinating conjunction.” Three syntactic isolates make up 2 structures, in 
Isa 30:1 (containing an extended vocative having 22 segments) and in Jer 23:1 (containing a voca- 
tive having 8 segments). A coordinating conjunction plus 2 labels appears in 1 Chr 26:25. 

To see what these clauses are like, consider phrase markers (19.4) [solo exclamative] from Num 
5:22, (19.5) [solo label] from Isa 13:1, and (19.6) [solo vocative] from 1 Sam 3:16. 

Of the 214 exclamatives in verbless clauses, 205 are initial, 5 are medial, ? and 4 are final.” Of 
the 364 labels in these clauses, only 8 are medial.” The 297 vocatives are much less positionally 
concentrated, with 33 solo, 52 initial, 164 final, and 48 medial. 


32. Num 2:14 (“and” + Ibl), 26:4 (*and" + Ibl); and Prov 8:32 (“and” + excl). 

33. Gen 20:12; Ps 8:2, 10; Prov 15:23; 16:16. 

34. 1 Sam 25:26; Ps 84:13, 137:8; Prov 29:18. 

35. While 201 are solo, 62 are initial, and 93 are final. All of the initial and final instances might well have been 
made solo. The 8 medial instances are in Exod 34:14; Jer 1:19, 30:11; Ps 51:1, 52:1, 54:1, 56:1, 60:1. 
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CIC Type Count CIC Type Count CIC Type Count 
Undesired outcome | 20 Includer 11 Lapsus calami 3° 
Exocentric absolute | 20 Nebulous 11 Number of times 2^ 
Involved ones 16 Object of address 10 
Movement bearing 16 Movement interval 9 
Quantity 16 Area T Condition 1° 
Composition 15 Ruled-over one i hs Separation i^ 
Agential 14 Result 6' 

Quoter 12 Alternate / surrogate 6! 
Cost 12 Deprivation 4* 


. Found in Gen 30:11, Ps 55:16, and Prov 27:24. 

. A curious form, subj CIC plus “number of times” CIC, occurs in 1 Kgs 7:4. 

. Found in Deut 4:6 and Judg 19:20. 

. There is a phrasal cause in 2 Chr 29:25b. 

Found in Num 35:4; Judg 1:36; Ezek 40:16; 41:9, 15; 43:12; Zech 9:10. 

There is an embedded conditional finite-verb clause (“if you-do-right") in Gen 4:7. 
. Solo in Gen 43:16; initial in 1 Kgs 16:9; final in 1 Chr 9:31, 33; 23:28; 26:29, 30. 
. Found in 2 Sam 3:28. 

Found in Isa 56:7; Ezek 17:9; Prov 14:23, 17:21; Song 5:9; 2 Chr 26:18. 

Found in Gen 30:2, 50:19; Josh 2:14; Isa 61:7; Zeph 2:10; Prov 21:18. 

. Found in Deut 3:5, 4:12; Jer 51:5; and Job 21:9. 


pw—rt5Bem^mnococrme 


(19.4) cl  ,diseqw 
oblq gram 


— * and 


vb : 
gram MAX she will say 
sbj n:humn , 
gram def 9 the 
MWR woman 
obj addr cl excl excl IDN amen 


— — —— 
gram oblq gram echo Ie : 
HNN amen 


(19.5) d _, tbl __, nthing __, n:thing NU burden of 


oblq gram app cstr E EN 
N 533 Babylon 


nom N SWN which 
cl vb lel 
oblq gram nmm he had a vision 
N sbj n:humn pw Yeshayahu 


No n:humn "I3 son of 


:PiDN — Amoz 
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(19.6) cl dl seq w "^ and 
oblq NL gram L 
vb NN? he said 
gram ; 
obj addr cl voctv n:humn anni 
gram oblq gram app Nn) ccm 


b n:humn "3 son 
cm "5 me 
19.4.3 Tally of Censored VLCs 


After we set aside clauses that contain rare CICs, ignore syntactic isolates, and suppress coordi- 
nating conjunctions, the CIC census of the 8,850 remaining VLCS is as follows. 


Number of | Verbless Clause Number of | Verbless Clause 
CICs Tally 
6 11 
1 
3 0.3% 
8 
3 


19.5 Atypical Subjects and Subject Complements in Verbless Clauses 


By a sizable margin, the two most frequently occurring CICs in verbless clauses are subjects 
(7,348x) and subject complements (5,016x).*° Only 66 of the subjects (0.9%) are nonsubstantives, 
being prepositional phrases. Only 52 of the subject complements (1.0%) are nonsubstantive, also 
prepositional phrases. As the table documents (see p. 269, top), 5 main prepositions are involved. 

The 12 instances of a CIC consisting of a prepositional phrase involving the nota accusativi 
make up a puzzling cohort. Consider the example shown in phrase marker (19.7) from Judg 20:44. 


(19.7) cl sbj obj mk + pron 
oblq gram prep TR [nota acc.] 
i ^ pers pr di 
cst ———5 772. all of 
TON these 
sbj cmp n:humn "VN men of 


gram cstr i ee 
>on wealth 


We accept the use of ns ‘(nota acc.)’ in this context as real, uncorrupted Biblical Hebrew. 


36. Data are for the 8,850 censored VLCs (§19.4.3). The next most frequent CICs are location (2,022x) and pos- 
sessor (788x). 
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(inf. constr.) (miscellany) 


22° 3! 


a. Exod 1:14; Num 22:6; 35:6; Josh 22:17; Judg 9:29; 20:44, 46; 1 Kgs 7:45; Ezek 47:19; Zech 8:17; 
Qoh 4:3; 2 Chr 31:10, 17. 

b. Exod 13:2; Isa 26:4; Hos 13:9; Mal 1:10; Ezra 3:3. 

c. Exod 8:22; 9:28; Josh 24:15; 1 Sam 15:22; 2 Sam 13:16; 18:11; Jer 40:4; Amos 6:10; Ps 92:2; Prov 
21:9; Qoh 5:17; 7:2, 5; 11:7; Esth 5:8; 6:6; Ezra 4:3; Neh 13:13; 2 Chr 20:17; 25:9; 26:18; 29:10. 

d. Gen 47:26; Num 26:45; 2 Sam 16:2; Ps 69:23; Song 1:3; Qoh 9:4; 1 Chr 26:21. 

e. Gen 40:17; Exod 9:28; Num 26:4; Ezek 7:11(3x); 43:14; Neh 11:36; 13:28; 1 Chr 5:18; 9:28, 30, 
32; 12:30; 2 Chr 20:1; 34:13. 

f. 1 Kgs 3:18, Job 41:3, Prov 6:26. 

g. Num 35:6; Jer 23:33, 45:4; 1 Chr 2:9. 
. Exod 18:4; Ps 118:7, 146:5; Job 23:13, 34:35, 37:10. 

i. Exod 16:32; Lev 3:6; 11:39; Isa 26:8; Jer 3:23; Nah 1:7; Hab 1:11; Zech 4:7; Ps 4:3; 37:26; 69:23; 
Job 6:26; 13:12, 16; Lam 4:3; Dan 9:16; 1 Chr 9:23; 21:3, 12; 29:11; 2 Chr 23:4; 28:21; 29:32. 

j. Gen 17:12; Exod 2:6; 30:2; Num 17:5; Judg 19:12; 1 Kgs 7:34, 35; 8:41; 11:14; 20:41; Isa 41:24; 
44:11; Hab 1:13; Ps 45:14; Ruth 2:20; Esth 6:13; 1 Chr 8:40; 2 Chr 6:32. 

k. Prov 6:26. 


a 


One of the 11 instances of a CIC within a VLC consisting of a prepositional phrase involving 
‘a ‘in’ has a à of specification." A few others involve beth essentiae.** The single instance of the 
former is shown in phrase marker (19.8) from Exod 13:2. 


(19.8) cl — , sbj — ,in*humn , in + humn T 
obiq susp union/disj prep 
PM n:humn 'o the 
N man 
and 
in M -— 
lg manmi ——+ ‘othe 
nia beast 
poss _ tos pron 5 v 
i E * me 
Si :N he 


In VLCs, the 22 instances of a subject CIC consisting of a prepositional phrase involving ^} ‘to’ 
plus an infinitive construct are well documented. ?? 


37. Once, as per Williams, 100, who cites Exod 13:2. 

38. The literature on this topic is sparse. GKC, 379, cite Ps 146:5 and Job 23:13 as examples but deprecates Isa 26:4 
as “textually very uncertain." Williams, 100, provides Exod 18:4 as one of two examples. Joüon /Muraoka, 458, assert 
that the beth essentiae is *used to indicate the predicate" and cite Exod 18:4; Ps 118:7, and 146:5. 

39. See, among others, Joüon /Muraoka, 401—2. Three of their examples are in our list (Josh 24:15; 1 Sam 15:22; 
2 Sam 18:11). We classify the three examples not in our list (Gen 23:8, 31:29; Esth 4:2) as quasiverbal clauses. 
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19.5 


There are 30 cases where 5 ‘to’ appears with a sister other than an infinitive construct. The 
significance of this configuration remains to be worked out.? Phrase marker (19.9) from Job 13:12 
shows a case where parallel subject complements appear first without a ^? and then with a ^5: 


(19.9) sent cl sbj — » 
discrs ^^" oblq gram — — — — — — — —— suf 7— 1M! memorials 
t X imm — 

E sbj cmp n:vocbl ror 
* gram str ee proverbs of 
i 7px dust 
A cl sbj cmp to * vocbl 
oblq^ * gram prep - to 
N n:vocbl on 
cstr EN ai answers of 
"nh clay 
" n:vocbl " 
gram suf —— —* 3à answers 
Sasa you(m) 


The 34 cases of a subject CIC or subject complement CIC consisting of a prepositional phrase 
involving some form of ~ ‘from’ are instances of partitive-2 that make a semantically nominal 
phrase.*! In a full representation (§9.2.2), the CICs would also have the SR of “resource” (rsrc). 


Phrase marker (19.10) from 1 Chr 9:30 illustrates the partitioning phenomenon: 


(19.10) al di and m 
oblq 1st in cl 
by sbj from + humn -n 
gram prep j 
n:humn y 
cai n - 
n:humn , 
def a 
ound 
sbj cmp n:humn np 
gram cstr 
EN n:thing n:thing "à 
mod de ' 
N nno 
to * veg r 
prep ? 
it n:veg > 
def 


and 

from 

sons of 

the 

priests 
pharmacists of 
the 

mixture 
to 
the 


balsams 


40. There is evidence that in the configuration under consideration the preposition 5 saliently marks a subject 


complement as the result of a process. See Job 37:10, Lam 4:3, and Dan 9:16, among others. 


41. GKC, 382; Williams, 122-23; IBHS, 213-14; van der Merwe et al., 289; Joüon/Muraoka, 460. 
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19.6 Operators and the Lead Zone 


Although we do not put much store in the details of the role and reference grammar (RRG) lay- 
ered structure of the clause where verbless clauses are concerned, we do take seriously the idea that 
certain constituents tend to occur, or perhaps are constrained to occur, in the lead zone or—as RRG 
would have it—in the “precore slot.” Van Valin sees the lead zone as “the position in which question 
words appear in languages in which they do not appear in situ . . . ; it is also the location in which 
the fronted element in a sentence . . . appears." ? We will show that the lead zone is the preponderant 
locus of the operators in verbless structures. We use the operators identified in $9.3.4 but have set 
aside, as before, clauses containing the infrequent operators: includer (11x) and restricter (2x) (see 
the table in $19.4.1, p. 267). 

19.6.1 Clause-Level 03 ‘also’ 

After we remove clauses containing rare CICs, the residuum of verbless clauses (8,850x) con- 
tains 40 instances of n3 ‘also’. Thirty-five of these are clause initial, and 3 follow an interrogative in 
the lead zone.* Two occur in sbj/susp — sbj cmp/gram — gam/gram - sbj/resum clauses. ^^ Overall, 
at least 95% of the clausal D3s occur in the lead zone. 

19.6.2 Modals 
There are 46 modals, of which 45 are in the lead zone (98%).* 


19.6.3 Clause-Level Negatives 

The reduced corpus of verbless clauses includes 349 negatives. Of these, 216 are clause initial, 
and 91 follow other operators (88 follow a closed interrogative,*© and 3 in Qoh 9:11 follow 03 
‘also’). This leaves 42 instances that occur outside the lead zone. Hence, 88% of the negatives are in 
the lead zone. The identities of the CICs preceding the 42 seeming-outlier negatives are interesting. 


Predecessor Sequence Count/ Cite Predecessor Sequence Cite 


Subject 25 ° Time point Zech 8:11 
Suspended subject Time aim 1 Chr 15:13 


Possessor 3 


Time origin 2 Chr 30:26 


Location Isa 63:9 
Manner Job 36:4 
Comparison | Manner Prov 26:1 


Suspended subject} Closed interg. 2* 
Subject Jer 23:23 


Time point Prov 19:2 


a. Nineteen of these involve VLCs having the form sbj /gram — neg / gram — sbj cmp / gram. We should consider 
whether the negatives might be parsed as inverted modifiers of the constituents making up the subject complements. 

b. Deut 11:10; 2 Sam 21:2; Prov 24:23, 28:21. 

c. Isa 53:2 (2x); Job 18:19. 

d. Gen 34:23; Jer 5:3. 


Closed interg. 


42. Van Valin, Exploring, 5. 

43. Follow an open interrogative: 1 Kgs 14:14 and Mal 1:10; follows a closed interrogative: Esth 7:8. 

44. Prov 17:15, 20:10. Both instances of n3 might be identified as inverted modifiers of the following ny? ‘two 
of them' and hence phrasal operators. 

45. The odd locus is in Job 12:2, where a manner CIC (“truly”) precedes the modal (“surely”). 

46. For the distinction between closed and open interrogatives, see $9.3.5.2. 
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If we declare suspended subjects to inhabit the lead zone, declare the various time margins to 
be fronted in the lead zone, and absorb the 19 negatives positioned between subjects and subject 
complements into the subject complements, then 97% of the negatives are in the lead zone. 


19.6.4 Closed Interrogatives 

There are 206 closed interrogatives in the reduced set of verbless clauses. All but four are clause 
initial (98%). Two are preceded by a suspended subject (Gen 34:23 and Jer 5:3), one by a move- 
ment origin (2 Sam 23:19), and one is between a subject and a subject complement (Jer 31:20). 
19.6.5 Overall Results 

Our results show that operators do gravitate strongly to the lead zones of verbless clauses. 


Percent in 


Operator Lead Zone 


Modal 98% 


Negative 88% /97% 
Closed Interrogative 98% 


19.7 Embedding and Verbless Clauses 


In this section, we examine how the embeddedness status of a verbless structure relates to its 
organization. We distinguish 9 kinds of embeddedness by their parent identity. Their incidence 
across the reduced set of 8,850 verbless clauses in Biblical Hebrew is given in the count-column in 
the table. (The final two rows report statistics for rare parental entities and will not be discussed.) 


Identity of Parent 
All Censored VLCs 
No parent (matrix structure) 


Sentence 
Nominalized structure 


Discourse unit with cue phrase 
Object of address 
Discourse unit without cue phrase 


Paradoxical discourse unit 


Miscellany 


Three of the identity labels include the phrase “discourse unit,” a topic that we do not take up until 
chap. 21. For our present purposes, suffice it to (1) remark that the non-paradoxical discourse units 
tend to dominate subordinated clauses, and (2) provide an example of each. Phrase marker (19.11) 
from Mal 3:6 shows a reason discourse unit with the cue phrase "3 ‘because’: 
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(19.11) Had ————— nana > ^3 because 
"V cl sbj 
oblq ET gram — 4X8 | 
gei mm Yahweh 


Phrase marker (19.12) from Num 15:31 shows a reason discourse unit that lacks a cue phrase. 


(19.12) Pa — Re — pi —— ja —— "Np iniquity 
no her 
loc ___, in+pron ^ in 
gram prep 
:o her 


Among the frequently occurring parents, nominalized clauses attract attention. Compared with 
the other frequently occurring parents, they have a low average CIC count (1.3), are much less 
likely to contain an overt subject (23% versus the overall average of 80%), and are unlikely to have 
an operator (1.796). Phrase marker (19.13) from 1 Kgs 1:8 shows a complex noun phrase containing 
a nominalized clause: 


(19.13) n:humn n:humn ^ the 
app def L 
D"3i33 heroes 
on SUN which 
E cl poss to * humn ^ to 


d gram prop nro A | 
TM David 


The form 717? ‘to David’ is a prepositional phrase, and calling it a possessor CIC is reasonable. 
Having the possessor CIC as the sole clause immediate constituent may seem odd, but it is emi- 
nently justifiable, as we will show in $19.9.6. 

Were there no nominalizer ?ÜN ‘which’, the prepositional phrase likely would have been com- 
bined directly with 0°91239 ‘the heroes’ and would have been licensed by “modification.” Phrase 
marker (19.14) from 1 Chr 18:2 illustrates a similar situation: 


(19.14) n:humn 


mod 


D'T39 servants 
N to + humn ^ to 


prep 
i" TT David 


Since nominalization is forced by the presence of Wx, and since we have declared that it is 
a clause that is nominalized by Wx, we have been constrained to create a sizable set of one-CIC 
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verbless clauses. There are 978 of these one-CIC nominalized clauses; 65% of them consist of loca- 
tion CICs, 16% of possessor CICs, 9% of accompanier CICs, and 10% other CICs. 


19.8 /ndeterminacy in CIC Enumeration 


When dealing with verbless clauses, one finds complicating factors. The nature and number of 
CICs making up a verbless clause are often uncertain. This is because syntactic ambiguity is ever 
with us. To illustrate what we mean by this, consider the following pair of phenomena. 


19.8.1 Combine Constituents or Not? 


Consider phrase-marker (19.15) from Jer 1:1, where we have dominated the two in«geog / prep 
constituents with an in«geog / superset node. In this representation, the embedded clause is a one- 
CIC (location) verbless clause: 


(19.15) nnum nhumn > y the 
app N 
by Dun3 priests 
nom Wks which 
nom A 
~ cl loc in + geog in + geog "4 in 
oblq gram superset prep BR : 
miny Anatot 
in * geog "à in 


Pae n M ——— 
n:geog Y 

cstr a PON  landof 

:-293 Binyamin 


In an earlier version of our database, the embedded clause consisted of two location CICs. Given 
our repertoire of licensing relations, the combining of the original pair of constituents was proper. 
However—and this is the critical point—whether constituents should be combined is not always so 
clear. Syntactic ambiguity is common. Consider phrase marker (19.16) from Song 2:14: 


cl sbj n:anml , > 
(19.16) oblq gram sup O0 TI cove 
X me 
loc in * bldg ‘a i 
gram prep n 
ir "un refuges of 


bu n:sbst " the 


loc in * abst 
— 
gram prep 


nano secret of 
n n:geog , 
def >‘) the 


mT cliff 
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Is this a three-CIC verbless clause, or should the location constituents be combined, giving a two- 
CIC verbless clause? The answers involve interpretive, exegetical decisions. 


19.8.2 Absorb Constituents or Not? 

Or consider phrase marker (19.17) from Ezek 32:2. Here the debate is whether “in the seas” 
should be a CIC that locates the subject “in the seas,” as shown, or better, should modify “the 
dragon.” Opinions may differ in cases such as this. 


(19.17) cl dl and i 
oblq N st in cl } and 
sbj - 
gram nnwN  thou(m) 
cmpr like + anml ^3 like 
gram prep 
n:anml 
def the 
oun dragon (V) 
loc in * geog "01 
gram prep scm 
n:geog "es 
dé C55 the 
ov" seas 


Because of these phenomena, the grouping of verbless clauses by number of CICs is not precise. 


19.9 One-CIC Verbless Structures and Clauses 


19.9.1 Impermanents and Isolates in One-CIC Structures 
As documented in $19.4, we set aside structures? that consist solely of single, rare CICs or 
single syntactic isolates, which leaves us with 1,466 one-CIC verbless structures to consider. 


19.9.1.1 Impermanent CICs 

There are 8 instances where we cannot determine the status of a one-CIC structure. One is a 
lapsus calami, a one-segment utterance in Gen 30:11. The other 7 CICs are labeled nebulous: 1 Kgs 
8:31, Isa 8:6, Jer 8:15 (2x), Mic 6:12, Ps 68:14, and Qoh 5:6. 


19.9.1.2 Syntactic Isolate CICs 

Syntactic Isolate CICs are numerous. In §9.3.2, we provided arguments that syntactic isolates 
are not syntactic constituents of clauses. For Biblical Hebrew, we nominated three sorts of CIC for 
membership in this CIC subtype: exclamative, label, and vocative. 

Exclamatives. There are 110 exclamatives among the one-CIC structures. Of these, 37 involve 
an oath formula. Phrase marker (19.18) provides an example from 2 Sam 22:47: 


(19.18) cl — , excl — , n:med ^n 


oblq gram cstr life of 


nmm Yahweh 


47. We write “structures” rather than “clauses” in this subsection because the items under discussion have little 
claim to be clauses. But see $19.9.6, where one-CIC covert-subject verbless clauses are taken up. 
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Labels. There are 201 labels. Consider, for example, phrase marker (19.19) from Isa 15:1: 
(19.19) cl — , lb ___ nthing 


oblq gram cstr ^ 7 NWN burden of 


Axim Moab 


This Ibl constituent is really only a label, a title actually. That the CIC is dominated by a clause node 
reflects our hesitancy to introduce a non-clausal node for these structures. 

Vocatives. Our repertoire of one-CIC structures includes 33 vocatives. Eight are in oracles, 
and 25 are in speeches. Phrase marker (19.20) from Hos 2:3 provides an example, a one-CIC 
speech. 


(19.20) cl vb 
oblq NC gram WAX — you(m) say 
ind obj to * humn s 
gun 7 5 to 


prep Enc L 
n:humn T 

suff ———^ "DN brothers 

is D3 you(m) 


obj addr cl — — , vocv — , nihumn people 


gram oblq gram suff "y 


O me 


19.9.2 Operators in One-CIC Structures 

Of the 6 operator CICs presented in §9.3.4, only 1 appears in one-CIC structures: negative 
(41x). The negatives are in speeches/oracles or in discourse units. Phrase marker (19.21) from 
Num 22:30 shows a negative-in-a-speech: 


(19.21) cl dl seq w ; 
oblq Ne gram ) and 
vb ; : 
gram ax’ he said 
obj addr cl neg ad 
gram oblq gram :N not 


Phrase marker (19.22) from 1 Sam 2:16 shows a negative in a one-CIC condition: 


19.22 UNit MEME , 
( ) cue | >") and 
Ww sent — cod O O ü O - 
discrs ^ ^ cue . > ON if 
^ ht 
\ Mod neg 
` —— 
x oblq gram N? not 
Nost vb sand I took 
no cue oblq gram UTE 
S instr in * abst "^ in 
gram prep 


b inp strength 
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19.9.3 The Influence of Context on One-CIC Clause Makeup 

We do not engage in discourse analysis at this stage of our analysis, but we will informally 
examine the incidence of the GFs and SRs in terms of “the grammatical function of the clause in 
its linguistic context [with f Jour relationships distinguished: independent, coordinate, subordinate, 
nominalized."^* We identify the final three relationships on the basis of any preposed cue phrase.” 
When neither cue phrase nor other identifying clue(s) is/are present, the relation of independence 
is posited. Modifying Andersen, we abbreviate the four relationships as Asyndetic (“unconnected”), 
Syndetic (“connected”), Subord, and Nomin, respectively. °° 

We group the SRs in one-CIC VLCs into the six groups detailed inside the back cover. 


Six Semantic Role Groups 
Other Participant Manner 
Movement and Spatial Enriching Constituent 


TM | Temporal DU | Mixed-Level and Phrasal 
Discourse Units* 


* Since our focus is on the verbless clause, we have assigned “time point" to the 
temporal superset. 


Using the GFs and SR groups and the four context relationships, we tabulate CIC incidence in one- 
CIC verbless clauses. 


operator 
PR 


Patterns emerge more clearly when we convert the columns into percentages. 


48. Andersen, The Hebrew Verbless Clause, 25. 
49. On cue phrases, see $8.2.1, $8.3, $9.3.1.1, and §21.3.2. A cue phrase is part of a discourse, not part of a clause. 
50. Andersen, The Hebrew Verbless Clause, 109. 


278 Chapter 19 19.9 


Asyndetic | Syndetic | Subord Nomin All 
sbj 29.1% 29.2% 16.1% 0.9% 9.2% 
dir obj 
ind obj 
cmp 


4.2% 1.7% — 0.7% 
— — — 0.1% 
8.3% 33.9% 1.0% 8.1% 


operator 11.1% 4.2% 10.2% 3.5% 
PR 6.7% 4.2% 19.4% 
SP 4.1% 4.2% 5.9% 69.2% 

TM 8.7% 8.3% 4.2% 1.5% 

MN 4.1% — 3.4% = 

EN 6.7% 12.5% 8.5% 0.3% 2.7% 
DU 6.7% 20.8% 11.9% 1.1% 3.6% 


The table enables us to refine our observations regarding one-CIC VLCs. For example, almost half 
of the CICs making up one-CIC VLCs have spatial SRs (far right column), but it is far more reveal- 
ing to observe that, while spatial SRs account for approximately 5% each of the asyndetic, syndetic, 
and subordinated one-CIC VLCs, they account for more than two-thirds of the nominalized one- 
CIC VLCs. Or, note that, while subject complements (cmp?') are fairly common in subordinated 
clauses (33.9%), they are very rare in nominalized clauses (1.0%). 

When the sole predicative CIC is a spatial or temporal CIC—as obtains in 51% of the tabulated 
cases—then Van Valin’s notion (quoted in §19.2) that “the primary constituent units of the clause 
[include] . . . a ‘periphery’, which subsumes non-arguments of the predicate, e.g., setting locative 
and temporal phrases,” ’® seems inappropriate. These CICs are the only explicit predicative constitu- 
ents and hence can hardly be considered “peripheral.” Surely they are nuclei. 


19.9.4 Grammatical Functions in One-CIC Clauses 

Our repertoire of GFs was presented in §9.3.5.1. We next discuss each with regard to one-CIC 
VLCs. 
19.9.4.1 Objects 

There are only 10 direct objects.” Six are in answers to questions (Deut 10:12, 20:11; Jer 24:3; 
Amos 7:8, 8:2; Ps 120:4). Phrase marker (19.23) from Amos 7:8 supplies the elliptical answer “[I 
see] tin" to the question diy AX nns “What are you seeing, Amos?’ 


(19.23) cl dl seq w ^ and 
oblq N gram : 
vb A : 
gram AX | said 
obj addr cl dir obj : 
gram oblq gram ^ Wh tin 


The remaining 4 instances of a sole direct object are not as clear-cut: 


51. One of the complements is not a subject complement, the cognitive complement (!?) in Neh 10:31. 
52. Van Valin, Exploring, 4. 
53. Additionally, among the one-CIC VLCS, we find no indirect objects and no distributive indirect objects. 
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Gen 49:25. The constituent is marked with an object marker, so we hesitate to force it to 
be the subject of the following clause contra NJPS. 

e Num 21:14. About which, the nyps has this footnote: “The quotation that follows is a 
fragment; text and meaning are uncertain." We agree. 

I Sam 19:3. The “what” that will be seen. 

Ps 136:23. A baffler... 


19.9.4.2 Subjects 

Among the one-CIC VLCs, we find no subjects-of-address, no distributive subjects, and no Ja- 
nus sbj-cmp/sbj constituents. We do find 135 subject CICs. These are a motley lot. Just over one- 
third (53x) are in speeches, and just over one-third (51x) are fragments that we called “subjects” 
(by default) rather than “fragments” or “nebulous,” leaving a residue one-fifth (31x) of miscella- 
neous wannabe clauses. 

Phrase marker (19.24) from 2 Kgs 9:19 shows speech (within a speech) consisting of a single 
subject CIC: 


(19.24) cl — , diseqw ^ and 


oblq N gram 
vb SNN’ he said 


gram 
obj addr cl mop 
gram ^ - oblq Ne gram m5 thus 
N i "DN he said 
sbj n:humn ; 
gram — — — — — — —* det ———7 the 
T?2 king 
obj addr cl sbj n 
gram oblq gram Di») peace 


19.9.5 Subject Complements and Semantic Roles in One-CIC Clauses 

When we remove from further consideration all of the CICs discussed in §19.9.1-§19.9.2 and 
the subjects and objects of $19.9.4, we are left with 1,266 one-CIC VLCS that overtly consist only 
of subject complements? or predicates with SRs already assigned—loc, poss, and so on. For the 
census of these CICs, see the table: 


Clause Immediate Clause Immediate 
Constituent Count Constituent Count 


Location 653 
Possessor 167 


Aim/purpose 29 


Beneficiary 20 


Subject complement Movement origin 20 


Accompanier | Comparison 19 
Time point Other 118 


54. Predicates, the semantic roles of which have yet to be assigned. 
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The first three CIC types account for about two-thirds of the total of all one-CIC verbless clauses. *° 
These predicate CICs populate the VLC nuclei. No subject CICs overtly combine with the nuclei to 
form fully fledged clauses. But, as we will see in the next section, there are covert subjects nearby 
or inferable. These one-CIC VLCs actually involve two CICs, a predicate and a covert subject. 


19.9.6 Covert-Subject Verbless Clauses 


We begin with a concrete example of the phenomenon that we are about to investigate. Our 
database includes many phrase markers such as the construction from Ruth 3:15 seen in (19.25). 
Considered by itself, it is impossible to make sense of it. What is “upon thee"? 


(19.25) nom 7W)N which 
nom VN 
N cl loc upon + pron ^ 
oblq gram prep 29 UPORI 
To thee(f) 


Here we have labeled a location-CIC as being a clause. That the actual subject of the clause is 
nearby can be seen by examining the nominalized construction in its larger context. The noun 
phrase of which it is a part has phrase marker (19.26). It is (the) cloak that is upon thee. The covert 
subject of the embedded clause is “cloak.” 


(19.26) n:garm nn50m cloak 
app i 
nom “W8 which 
nom a 
Ns cl loc  , upon + pron ^ , nby upon 


oblq~ ' gram prep 


Bom (T?  thee(f) 


This sort of behavior is common. When we examine all 9,500 verbless clauses, we find that 2,224 
lack a subject (23.496). That is, almost a quarter of the structures that we have classified as verb- 
less clauses lack an overt subject. Among the 1,466 one-CIC VLCs, 1,327 lack subjects (91%). But 
these verbless clauses do have a covert or linked subject, as the notion of control describes. 


19.9.6.1 Kinds of Control in Biblical Hebrew 
According to Jaworska: 


There is a range of subject-predicate constructions where the subject appears to be missing. Yet, it is 
quite easy to interpret the missing element correctly as either coreferential to some other noun phrase 
(NP) within the same sentence or as having arbitrary reference.? 


55. As predicate, almost 52% of the clauses have a location CIC, 13% have a possessor CIC, and 9% have a subject 
complement. The overwhelming majority of the location CICs are in nominalized clauses (642/655 or 98%). 

56. E. Jaworska, “Control,” in Concise Encyclopedia of Grammatical Categories (ed. Keith Brown and Jim Miller; 
Amsterdam: Elsevier, 1999) 107. For a presentation of the two main theories of control, see Richard Hudson, “Con PRO, 
or the virtues of sharing,” University College London Working Papers in Linguistics 7 (1995) 277-96. 
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Note that it is a lacking subject in the embedded clause that is usually supplied by an NP in the 
matrix clause or some prior clause.? This noun phrase is called the controller. 
In Biblical Hebrew, we have found these four common kinds of control: 


* Head control. The covert subject is the head of the embedding noun phrase. In (19.27) 
from 1 Chr 22:2, we see head control, a very common phenomenon. The n:humn/app noun 
phrase head, màn ‘the sojourners’, is the subject of the embedded nominalized clause. 


(19.27) n:humn n:humn , 
app def J the 
Bc" D'à  sojourners 
nom , . 
nom SWN which 
\ cl loc in + geog PET 
oblq gram prep 3 in 
n:geog 


— > pin land of 


ud in . 
Oxi? Israel 


e Discourse control. The covert subject is found in the preceding discourse. Phrase marker 
(19.28) (see p. 282) from 2 Chr 20:31 shows discourse control. The subject of the first 
clause is the covert subject of the second clause.°* 

e Object control. The covert subject is the object in the matrix clause. Phrase marker 
(19.29) (see p. 282) from Gen 1:4 shows object control. “The light" is the direct object in 
the matrix clause and the subject in the embedded clause. 

* Arbitrary control. The covert subject must be inferred. Phrase marker (19.30) (p. 283) 
from Gen 7:23 illustrates arbitrary control. The covert subject of the nominalized clause is 
supplied by the nyps: “Noah . . . and those with him in the ark." 


The foregoing are the main kinds of control but are not the only kinds. 

We have seen that the one-CIC VLCs are not somehow deficient but, rather, exhibit the phe- 
nomenon of control. Hence, we augment our glossary definition of clause by adding "possibly 
covert" to yield: 


clause — Typically, a syntactic unit that includes a predicator and the clause immediate 
constituents that accompany it. In Biblical Hebrew, a possibly covert subject plus some other 
constituent(s) can constitute a clause that has no predicator (a *verbless" or “nominal” clause). 


57. See also D. Crystal, A Dictionary of Linguistics and Phonetics, 107; P. H. Matthews, Oxford Concise Diction- 
ary (Oxford: Oxford University Press, 2005) 74; R. L. Trask, A Dictionary of Grammatial Terms (London: Routledge, 
1993) 62. 

58. We do not call this “subject control” because the second clause is not embedded in the first clause. 
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(19.28) 
obl arid pgs 3 and 
\ he reigned 
us —— vovin Yehoshapat 
0 n 
Cmm Judah 
RA — pod NI "]3 son of 
= unionidis —À UV» thirty 
bu ^ and 
wan five 
"JV year 
€ jc pen '3?2 reign 
fe — — — i his/him 
(19.29) x TT "^ and 
den NT he saw 
E. DDR God 
pr obi obj pid sbst TN [nota acc.] 
oM md the 
b^ nix light 
gd ^73 that 
cl sbj cmp — zv good 


oblq 


gram 
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(19.30) cl — ,diseqw 
oblq 


^"^ and 


"NV* he remained 


gram 
ad d ew mod “IN only 
unione ni Noah 
M es "^ and 
on WR which 


nom 
\ cl accmp wih *pron — 3 — —  , 'hN with 


obiq gram prep 
m f Mafhim 


loc in * furn 
— 


gram prep * in 


ig n:furn 'o the 


def 


san ark 


19.10 Two-CIC Verbless Clauses 


When we shunt aside 100 two-CIC VLCs that involve a Janus subject complement-subject,?? 
set aside all clauses that contain any "rare CIC," and omit empty clauses that result when syntactic 
isolates are ignored, then we are left with 5,320 clauses for study. 

19.10.1 Operators in Two-CIC Clauses 

Only two of the operators listed in $9.3.4 occur in significant numbers: negative (57x) and 
closed interrogative (29x). All but five of the negatives are clause initial. The closed interroga- 
tives are all clause initial. Hence 94% of the operators in two-CIC VLCs are clause initial. As is 
the pattern across all VLCs, operators in two-CIC VLCs strongly tend to appear at the beginning 
of the clauses. 

19.10.2 The Influence of Discourse Context on Two-CIC Clause Makeup 


19.10.2.1 Two-CIC VLC Contexts 
As with one-CIC VLCs, we identify the four clause contexts exemplified below. 


1. Asyndetic. This context type consists of seemingly freestanding clauses, as in phrase 
marker (19.31) from Song 1:15: 


(19.31) cl sbj n:body prt : 
oblq gram suff Te eyes 
N TO thee(f) 
sbj cmp . i, 
gram :Du» doves 


59. These verbless sentences are addressed in appendix 6. 
60. The five clause-final negatives are in Gen 24:21, 27:21, 37:32; Judg 14:15; and 2 Kgs 20:19. Each is preceded 
by a closed interrogative. 
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None of the freestanding clauses is shown as part of some larger (discourse) unit only because 
we have not yet advanced much into discourse relations (see chap. 21). 
2. Syndetic. Phrase marker (19.32) from Ps 73:22 shows this type of context:°! 


(19.32) cl dl and ; 
oblq ap ) and 
sbj " 
gram Ww | 
sbj cmp 


gram — yl stupid 


3. Subordinated. Phrase marker (19.33) from Lev 5:11 is illustrative: 


eed eH inscadesresenacsersccssesseeses= +> because 
^ pe I pha —> NRUN sin offering 
sbj 


"NY 
gram ———;N'"n she 


(19.34) dir obj n:veg n:veg 
gram app mod 


4. Nominalized. Phrase marker (19.34) from Gen 1:11 shows nominalization: 
"* fruit 


S to + abst 
prep ee -— D 
- 'TD kind 
mw i his/him 


nom . 
nom T WN which 
cl sbj n:veg , 
oblq gram suff yt seed 
"| his/him 
loc in * pron "^ in 


gram prep 
D i his/him 
19.10.2.2 Two-CIC VLC Supersets 


We group the SRs in two-CIC VLCs into the six groups detailed inside the back cover. 


Semantic Roles 


PR | Other Participant MN | Manner 
SP | Movement and Spatial | EN | Enriching Constituent 
TM | Temporal DU | Mixed-Level and Phrasal 


Discourse Units? 


a. Since our focus is on the verbless clause, we have assigned “time point" to the 
temporal superset. 


61. At first glance, this VLC has three CICs. But the coordinating conjunction is impermanent. It awaits promotion 
into discourse. It connects the clause with its prior context. 
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19.10.2.3 Incidence as a Function of Context 
We first look at simple CIC incidence as a function of context. We use the GFs and six SR 
groups along with the four context relationships introduced in $19.9.3. 


cmp 


j 47.1% | 471% | 489% | 433% | 47.6% 
dir obj 0.1% = 0.1% = 


mdj | 00% | (ig | - | - | 008 


sbj 


cmp 


: 34.0% 27.3% 36.0% 24.9% 33.2% 
operator 0.8% 1.2% 0.5% 1.3% 0.8% 


PR 4.6% 6.9% 5.3% 
SP 5.4% 22.2% 8.1% 
TM 2.9% - 1.296 


MN 0.2% 0.2% 0.2% 0.3% 
EN 1.7% 0.5% 0.2% 0.8% 
DU 2.5% 0.9% 1.0% 2.5% 


The table enables us to make some observations regarding two-CIC VLCs: (1) Almost half of the 
CICs in two-CIC VLCs are subjects. Hence, an overt subject is present in most of the two-CIC 
VLCs.© (2) A subject complement represents about one-third of the CIC total.9 (3) Spatial SRs in 


62. There are 5,320 two-CIC VLCS (and hence 10,640 CICs). There are 5,061 subjects. Hence, 95% of the two-CIC 
VLCS contain overt subjects. It might be instructive to examine the 40 nominalized clauses lacking overt subjects (297 
— 257 = 40) to ascertain whether covert subjects are in their picture. 

63. One of the complements is not a subject complement, the cognitive complement (!?) in 2 Chr 10:16. 
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Nomin clauses are two times more common than in Syndetic clauses and 3% times more common 
than in Asyndetic or Subord clauses. 


19.10.2.4 Sequence as a Function of Context 

Given 4 types of contexts and the 11 CIC groups, there are roughly 500 possible sequence pat- 
terns to keep track of in two-CIC VLCs. In practice, however, only 66 patterns are realized, and 
many of these are realized feebly. Rather than document all of the patterns, we present in table form 
only the 3 most common patterns for each context. 


Asyndetic Syndetic Subord 
Rank #1 sbj+cmp | 1,533] 41%] sbj+cmp cmp-sbj 3574596! sbj+cmp 
Rank #2 | cmpesbj | 929| 25%| cmp+sbj 82| 17%| sbjecmp | 211|2695| sbjeSP 
Rank #3 sbj+SP 322 | 9% | sbj+SP 67| 1496| sbj+SP 56| 796| cmp-sbj 13% | 
Fraction 
included 7596 6796 78% 65% 


(1) The top three sequences account for around 70% of the clauses in each context type. (2) Unlike 
the situation in the other contexts, subordinated two-CIC VLCs prefer the cmp+sbj order. (3) Note 
that “setting locatives” (SP) fairly often are the sole predicative constituents and hence constitute 
their clause’s nucleus rather than periphery. 


19.10.3 Grammatical Functions in Two-CIC Verbless Clauses 


19.10.3.1 Objects and Complements 

There are only 6 direct objects. In 4 cases, a verb may have been dropped out: Exod 32:29, 
2 Sam 23:17, 2 Kgs 4:14, and Isa 66:18. One is in an answer to a question, | Sam 6:4. One is very 
strange: Deut 11:2. 

There are 3 indirect objects. Twice, they are contained in questions (Num 14:27 and Jonah 1:6). 
Once, ellipsis is involved (Hos 3:3). 

With regard to complements, there are 3,530 subject complements and 1 cognitive complement 
(2 Chr 10:16). 


19.10.3.2 Subjects 

There are 259 two-CIC VLCs with no overt subject indicated: 173 asyndetic, 28 syndetic, 18 
subordinated, and 40 nominalized. When there is no overt subject in a two-CIC VLC, the covert 
subject may be: 

1. Quite nearby. This is the case for 33 of the 40 nominalized two-CIC VLCs lacking overt 
subjects. Consider, for example, phrase marker (19.35) from Zech 14:4 (see p. 287). Here, 
the subject of the nominalized clause is just the head of the apposition phrase. 

2. Remote from the clause under study. Consider phrase marker (19.36) from Isa 53:3 (see 
p. 287). In searching for an explicit constituent that can serve as the subject of this clause, 
we are forced to go back five verses, to Isa 52:13, where we encounter "739 ‘my servant’. 

3. Not a simple constituent. Consider phrase marker (19.37) from Gen 48:18 (see p. 287). 
Here, the subject might be said to be the action of Jacob's putting his hand upon Ephraim's 
head. 
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19.35) n:geog  , n:geog : 
( ) app cstr EM oe s7 mountain of 
n:veg , 
def EM n the 
ont olives 
nom 
nom a WN which 
cl loc upon + body prt as 
obiq gram prep by upon 
NG n:body prt 
cst iin 38 face of 
D»v1v Jerusalem 
mvt orig from + sptl , 
gram prep i NEN D from 
DTI? east 
19.36 cl di and . 
( ) oblq 1st in cl | and 
bu cmpr like + ptcp ^) like 
gram prep : 
nom ptc nom ptc cstr ptc LE 
cstr oblq gram "men hiding of 
dir obj 
gram 039 face 
mvt orig from * pron : 
gram — prep ii from 
330 — his/him 
sbj cmp : 
gram ma despised one 
(19.37) cl neg À 
— 
oblq SUN gram Nd not 
mnr 
Ý 
gram 12 thus 
voctv n:humn , 
gram suf ——— X father 


T ' 0 me 


19.11  Three-CIC Verbless Clauses 


Together, the one-CIC and two-CIC VLCs account for over three-quarters of the verbless clause 
stock (7896). The three-CIC VLCSs account for nearly another one-fifth (1796). After carrying out 
our usual reduction procedures, we have 1,534 three-CIC verbless clauses. These we subdivide into 
four major types: 


]. Subject and subject complement (sbj and sbj cmp) plus one different CIC (5696) 
2. Subject (sbj) plus two non-sbj cmp CICs (29%) 

3. Three non-subject (non-sbj) CICs (596) 

4. Suspended and resumed subject (or distributive subject) plus one CIC (10%). 
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Before examining types 1—3, we separate off the type 4 clauses, since they overlap with other types. 


19.11.1 Subject and Subject Complement plus One CIC 


The following table documents how three-fifths of the three-CIC VLCs (61%) consist of a sbj 


CIC and a sbj cmp CIC, plus one additional CIC. 


CIC Sequence Incidence Most Frequent “???” CIC 
sbj sbj cmp eee 263 loc 80 | benf | 36 | refrnt | 31 
sbj cmp sbj 22? 275 loc 69 | tm pt | 43 | benf | 28 
22? sbj sbj cmp 88 intrg | 15 | gam | 15 | loc 13 
22? sbj cmp sbj 91 intrg | 28 | neg | 25 | mdl 10 
sbj 22? sbj cmp 95 loc 21 | neg |20 f| poss | 13 
loc 11 | benf | 10 | comp | 10 


19.11.2 Subject plus Two CICs 


Approximately one-third of the three-CIC VLCs have a sbj CIC plus two more CICs, neither 
being a sbj cmp. 


Incidence 
168 
170 


Most Frequent “??? + ???" CIC Pairs 

| 12 | loc + loc | 10 | loc + poss 
17 | neg + poss intrg + poss 
9 | cmpr + mnr intrg +neg 


CIC Sequence 


222 | 22? loc + aim 


neg + loc 
22? | 222 | sbj neg + poss 


Total | 


440 


19.11.3 Subject-less VLC: Three-CIC No-Subject VLCs 

Of the 1,534 three-CIC verbless clauses, 74 (596) lack a subject. There are few repeated pat- 
terns. Seventy-seven patterns are hapax, 14 are dux, 7 are tris, 1 occurs 4 times, one occurs 6 times 
(all in Numbers 29), and 1 occurs 12 times (all in Psalms).9 
19.11.4 Suspended and Then Resumed Subject plus One Different CIC 

The sbj-susp sbj-resum variant accounts for about 10% of the three-CIC VLCs. 


CIC Sequence Incidence Most Frequent “???” CIC 
sbj-susp | sbj-resum | ??? 86 || sbj cmp | 73 | loc 3 | cmpr| 2 
sbj-susp | ??? sbj-resum | 79 || sbj cmp | 57 | poss | 11 | loc | 5 
Total | 165 


64. These are actually all mismarked elliptic clauses: Num 29:17, 20, 23, 26, 29, and 32. 
65. These are all labels (titles): Psalm 5:1, 8:1, 9:1, 12:1, 22:1, 53:1, 55:1, 62:1, 67:1, 76:1, 77:1, and 80:1. 
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The phenomena of suspension and resumption of constituents were introduced in §9.3.5.2 above. 
An example should recall the idea of suspension / resumption. Consider phrase marker (19.38) from 
1 Kgs 20:3. 


(19.38) cl sbj n:humn n:humn i 
oblq susp ^ unionidij L — — — ^ suf ~ ^" VÀ women 
BU q thee(m) 
^ and 
n:humn n:humn 


"31 sons 


\ ™ T thee(m) 

adj , 

def bw m ine 
D'àib good 

poss to * pron ^ to 


gram prep Mes NN 
^O me 


sbj 
resum 


app 


:Da they(m) 


The clause-initial CIC is the subject of the clause. Its susp licensing relation makes sense in light of 
the resum relation at the end of the clause. The referents of 02169 3333 PHI and 07 are identical. 
We say that the pronoun 07) resumes the suspended subject D210] PIN PHI. Note the attachment 
ambiguity in this clause. The nyps translates this: “your beautiful wives and children are mine." We 
parse it as *your wives and your good sons are mine'. Take your pick. 

A three-CIC VLC is said to be a tripartite verbal or nominal clause when “one of the three 
components is a third-person independent personal pronoun. One of the remaining two components 
may not be a noun phrase (NP) but an adverbial or a prepositonal phrase."56 We leave the theory 
of tripartites to others. 

There are also structures containing three CICs that we call Janus sentences. These are briefly 
treated in appendix 6. There, we analyze certain trios of CICs as sentences consisting of a pair of 
linked two-CIC verbless clauses, the middle CIC performing double duty in the first and second 
clause. 


66. T. Muraoka, “The Tripartite Nominal Clause Revisited,” in The Verbless Clause in Biblical Hebrew: Linguistic 
Approaches (ed. C. L. Miller; Linguistic Studies in Ancient West Semitic 1; Winona Lake, IN: Eisenbrauns, 1999) 188. 

67. Those interested in the details of tripartite verbless clause analysis will do well to consult W. van Peursen, 
“Three Approaches to the Tripartite Nominal Clause in Syriac,” in Corpus Linguistics and Textual History (ed. P. S. F. 
van Keulen and W. T. van Peursen; Assen: Van Gorcum, 2006) 157-73. The references in this paper provide entrée to 
the literature of this topic. See also in this volume the responses by Gideon Goldenberg (pp. 175-84), by Jan Joosten 
(pp. 185-88), and by Takamitsu Muraoka (pp. 189—96) along with van Peusen’s re-response (pp. 197-204). 
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A somewhat related phenomenon is worth noting. There are three places where a subject is im- 
mediately followed by a distributive subject followed by some other distributive CIC: Exod 25:20, 
37:9; and Num 16:17. 


19.12 Multi-C/IC Verbless Clauses 


Together, the one-, two-, and three-CIC VLCs account for 95% of the VLCs, leaving the four- 
CIC through ten-CIC VLCs to account for the final 5%. 


19.12.1 Four-CIC Verbless Clauses 
We count 349 four-CIC VLCs.® These subdivide into four major types: 


1. Subject and subject complement (sbj and sbj cmp) plus two other CICs (50%) 
2. Subject (sbj) plus three non-sbj cmp CICs (22%) 

3. Suspended and later resumed subject plus two CICs (15%) 

4. Four non-subject (non-sbj) CICs (8%) 


We do not go into detail. Phrase marker (19.39) from Deut 14:19 is an example of type #3: 


(19.39) cl dl and ; 
oblq 1st in cl | and 
N sbj n:anml 53 allof 


susp cstr 


b" nanmi lI POV swarm of 
b n:anml " the 


"ip bird 


sbj cmp n:qual 
gram mod RNV unclean 


sbj ; 
resum Nin he 
to + pron ^ to 


prep ? 
D2 


you(m) 


19.12.2 Five-CIC and Longer Verbless Clauses 

The 81 verbless clauses having 5 or more CICs account for 0.9% of the VLCs. We do not in- 
vestigate them. Here, however, are the CIC sequences and references for the 12 longest. Note the 
similarities exhibited by 8 of the 10 clauses in Numbers 1. 


68. When we ignore syntactic isolates and operators, 267 four-CIC VLCs are unchanged while 7 five-CIC VLCs 
and 5 six-CIC VLCs reduce to four-CIC VLCs. 
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# CICs CIC Sequence Reference 
9 | obj poss | loc obj loc loc area | loc obj Ezek 41:15 
cmp cmp 
9 | poss | refrnt | refrnt | — instr | rsrc sbj- sbj- poss | sbj Num 1:24 
susp | resum cmp 
9 | poss | refrnt | refrnt | — instr | rsrc sbj- sbj- poss | sbj Num 1:26 
susp | resum cmp 
9 | poss | refrnt | refrnt | — instr | rsrc sbj- sbj- poss | sbj Num 1:28 
susp | resum cmp 
9 | poss | refrnt | refrnt | — instr | rsrc sbj- sbj- poss | sbj Num 1:30 
susp | resum cmp 
9 | poss | refrnt | refrnt | — instr | rsrc sbj- sbj- poss | sbj Num 1:34 
susp | resum cmp 
9 | poss | refrnt | refrnt | — instr | rsrc sbj- sbj- poss | sbj Num 1:36 
susp | resum cmp 
9 | poss | refrnt | refrnt | — instr | rsrc sbj- sbj- poss | sbj Num 1:38 
susp | resum cmp 
9 | poss | refrnt | refrnt | — instr | rsrc sbj- sbj- poss | sbj Num 1:40 
susp | resum cmp 
10 | poss | refrnt | refrnt | refrnt | instr | rsrc sbj- sbj- poss | sbj Num 1:32 
susp | resum cmp 
10 | sbj- sbj- refrnt | refrnt | instr | rsrc refrnt | refrnt | poss | sbj Num 1:42 
susp | resum cmp 
10 | sbj poss | loc mvt sbj mvt loc loc mvt poss | Ezek 48:21 
aim cmp | aim aim 
19.13 /nsufficiently Investigated Problems in Verbless Structures 


To conclude this chapter, we provide a list of some problems associated with VLCs that await 
proper investigation. 


I, 


The binarism of the prevailing definitions of VLC as S and (nonverbal) P has neglected or 
overlooked VLCs with more than two CICs. We have seen, however, that Biblical Hebrew 
contains VLCs with as many as ten CICs. 


. The proposal to treat VLCs as m^ clauses with the equative verb ellipted is part of a 


larger and more general question of possible verb ellipsis in clauses that contain no verbal 
or quasiverbal predicator. Diagnostics are needed for telling the difference between a true 


VLC and a pseudo-VLC with verb ellipsis. 
. A verbal or quasiverbal predicator is not the only kind of CIC that may be ellipted from a 
VLC. 
. In the literature, the categories used to describe the syntax of VLCs are a mixture of 


grammatical, semantic, logical, pragmatic, and rhetorical functions and relations. For 


progress to be made, these dimensions need to be distinguished in a principled way. 
. The differing functions of VLCs in discourse constitute a parameter, the variables of which 


are formal, modal, semantic, and functional. Formally, a VLC may be asyndetic, syndetic, 
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subordinated, or nominalized. The possible correlation of the prototypical discourse 
functions of these types of VLCs with their distinct forms invites systematic investigation. 
Considerations of discourse coherence and cohesion may involve tendencies to ellipsis. 
This is evident in the abundant control involved with nominalized VLCs. While modality 
may be indicated by some operator, it can also be the result of discourse coherence. 
Polarity is another variable that has not been taken into account. 

6. The relative definiteness of the two constituents making up a standard two-CIC VLC assists 
in sorting out their functions. What differs among the researchers who pursue this notion 
is the fineness of the distinctions with regard to relative definiteness that they work with. 
When binary distinctions are involved, we find assertions such as: “In an identifying clause 
both the subject and predicate are definite. . . . In a classifying clause the subject is definite 
and the predicate indefinite.” Finer distinctions with regard to definiteness have been 
investigated but not fully worked out.” 


19.14 Brief Summary 


Naming Conventions and Agreed Characteristics. Surveying the evidence for terming these 
predicator-less structures nominal clauses versus verbless clauses, we retain our preference for 
verbless clauses (VLCs). There is substantial agreement in the literature regarding many character- 
istics of the verbless corpus, and we agree with almost all of this material. 

The Structure of VLCs. We see models of clauses such as the model advanced by Role and 
Reference Grammar as having limited usefulness where VLCs are concerned. Working in terms of 
information structure probably has much more to offer. 

Elliptic Verbal Clauses. These masquerade as verbless, usually having their predicators in 
adjacent clauses or at least nearby. 

Removing Marginal VLCs and CICs. In order to discern patterns the better, we drop from 
consideration verbless clauses that contain globally rare CICs. In the remaining corpus, we delete 
all syntactic isolates. For most studies, we also delete clause-initial coordinating conjunctions. 

Operators. Study of the positioning of operator CICs discloses that they strongly prefer to be 
clause initial (around 97%). 

One-CIC Verbless Clauses. After winnowing out rare CICs and isolated CICs, we obtain a 
one-CIC VLC corpus of 1,466 clauses. When we examine the distribution of one-CIC VLCSs across 
four kinds of contexts, we find that spatial SRs occur in about 596 of the asyndetic, syndetic, and 
subordinated contexts but in 69% of the nominalized contexts. Furthermore, while subject comple- 
ments are fairly common in subordinated clauses (3496), they are very rare in nominalized clauses 
(1%). Many one-CIC VLCs involve the phenomenon of control, whereby a “missing” subject is 
actually to be found in an embedding noun phrase, nearby in discourse, as the object of a matrix 
clause, or in world knowledge. Control situations might well be considered a subclass of ellipsis. 


69. But, it should be noted, very few investigators work, even in part, with definiteness. 

70. Van der Merwe et al., 248-49. 

71. Forareliable survey of Andersen's notions regarding definiteness and a trial of relative definiteness for Judges, 
see Kirk Lowery, "Relative Definiteness and the Verbless Clause," in The Verbless Clause (ed. C. L. Miller; Winona 
Lake, IN: Eisenbrauns, 1999) 251-72. 
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Two-CIC Verbless Clauses. This is the largest group of VLCs, 5,320 in all, omitting Janus 
clauses. The few operators found in two-CIC VLCs are predominantly clause initial (94%). An 
overt subject is present in most two-CIC VLCs (95%). Subject complements are one-third of the 
CICs. Spatial SRs in nom clauses are two times more common than in syndetic clauses and 3'4 
times more common than in asyndetic or subord clauses. 

Three-CIC Verbless Clauses. In the text, we have only briefly summarized this class of 1,534 
VLCs. They almost always contain a subject (95%). More than half of them consist of a subject, a 
subject complement, and one different CIC. This “different CIC" has a location SR more often than 
any other (29% of the three-CIC VLC corpus). 

Multi-CIC Verbless Clauses. Four- through 10-CIC VLCs make up the final 5% of the corpus. 
For details, consult the text. 


Chapter 20 


Non-Tree Phrase Markers 


Until now, with a few exceptions, our phrase markers have been labeled, directed trees. Being 
trees, the phrase markers have obeyed these three constraints, briefly introduced in $4.2: 


Non-tangling Condition—Edges may not cross. 
Single-Mother Condition—Any non-root node has only one mother. 
Single-Root Condition—There is only one root node. 


As we will see below: Multiple Roots — Multiple Mothers — Tangling. For example, a phrase 
marker having multiple roots must also have multiple mothers and exhibit tangling. 

In this chapter, we examine the circumstances where syntactic structure requires violation of 
tree constraints. When this occurs, the resulting phrase marker is a graph, not a tree. 


20.1 Tangled Edges: Discontinuity 


20.1.1 An Instance of a Tangled Edge 

Tangling occurs when there is no way to lay out a phrase marker so that no edge crosses anoth- 
er. When this occurs, there is discontinuity, which is “the splitting of a constituent by the intrusion 
of another constituent."? For the present, we seek simple discontinuity without attendant violation 
of either the single-mother or single-root conditions. Consider phrase marker (20.1) from Deut 
23:16 (see p. 295). Here, the subject is the segment 739 ‘servant’ in discontinuous apposition with 
an extended nominalized clause providing a central characteristic of the servant. There is no way 
for these two constituents to be joined without having a crossing edge. 


20.1.2 The Incidence of Discontinuity as a Function of Licensing Relation 

It is interesting to study how the incidence of discontinuous constructions varies from one li- 
censing relation to another. Some licensing relations are of a sort that never occurs with discontinu- 
ity. For a few, discontinuity is common. For each licensing relation encountered in the daughters of 
CICs, the following table presents the percent of occurrences that involve discontinuity. 


1. Because of layout engine inadequacies, a phrase marker may exhibit edge crossing that is avoidable. An egre- 
gious example was shown in §6.4.2.2, phrase marker (6.18). That phrase marker involves no intrinsic edge crossing. 
2. From our glossary. 
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COD) a S Nb not 
N ud "On thou(m) wilt shut up 
AM, pe 73D servant 
d wkn mn -bg unto 
ie ee nunn — "7 lords 
dp m his/him 
nom -WUN which 
N RS — gam 5wp he will deliver himself 
^ pris, m nbg unto 
oy thee(m) 
piden uia ps 'm from 
oom with 
nihun Le "ITs lords 
21 his/him 
Licensing Percent Count Incidence 
Relation Discontinuous | Discontinuous Count 
Definite 
Prepositional 
Suffixation 
Construct 
Union 
Apposition 
Joined 
Subsetted 
Echoed 
Modified 
Nested 
Supersetted 
Inverted Modification 
Bonded 
Distributed Apposition 74.096 37 50 


Discontinuous Suffixation. 


The single instance of discontinuous suffixation and the 9 instances 


of discontinuous constructs immediately attract our interest. The discontinuous suffixation is in Ps 
89:3. Since it involves a violation of the single-mother constraint, we take it up in $20.2.3.4. 
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Discontinuous Construct. The 9 discontinuous constructs are a fascinating lot. They may be 
found in: Exod 26:2, 26:8, 36:9; 2 Sam 1:9; Isa 10:5; Jer 50:5; Hos 6:9, 14:3; Job 27:3. The con- 
struction in phrase marker (20.2) from Jer 50:5 is especially curious: 


(20.2) 


Pre Ne Wx Zion 
Ño an RW? they(m) will ask 
dir obj __ ngeog —_. 
gram — JIT path of 


Discontinuous Union / Disjunction, Apposition, Join. "These are not infrequent, occurring in 


4% to more than 8% of instances. Phrase marker (20.3) from Nah 1:2 provides a simple example: 
(20.3) cl 


sbj cmp n:deity 


oblq sc gram union/disj Dii avenger 
sbj 
gram mm Yahweh 
^ and 
n:humn 
cstr 5y3 owner of 
nan rage 


Discontinuous Echo. This category may seem strange until one considers the alternatives. 
Consider phrase marker (20.4) from Josh 19:32 (see p. 297). If we do not call the second ^32? 
maJ ‘to sons of Naptali’ an echo, what do we do with it? To avoid the tangling (discontinuity), 
some syntacticians might designate the first instance a "suspended beneficiary" and the second a 
"resumed beneficiary." But the idea of a constituent that is left hanging as being reactivated or 
completed by a later constituent hardly applies when the constituents are identical. Since we do not 
avoid discontinuity, we are happy with our solution. It seems to us to be less of a representational 
stretch. 

Discontinuous Modification. We have identified a fair number (195x) of these. That a later 
constituent in a clause often discontinuously modifies an earlier constituent in a clause is not in any 
doubt. Consider, for example, phrase marker (20.5) from 1 Sam 17:23: 


(20.5) E — ei — n sam po mbi Goliath 
b na n the 
he "^55 Philistine 
poss: y ———— "QV name 
iom i his/him 
from * geog 7) from 


prep Br tM | 
na 


Gat 


20.1 Non-Tree Phrase Markers 297 


(20.4) cl benf to + humn to + humn D 
oblq gram echo prep s 


to 
33 sons of 


| "NE mc i 
"»n5i Naptali 


vb NY he went out 


gram 


sbj n:utnsl a , 
gram app def CH n the 
biz lot 
n:qty , 
def E n the 
"UU sixth 
to + humn , 
ion 5 to 
n:humn 
cstr Rc 93 sons of 
"»n5i Naptali 
refrnt to + humn r 
gram prep NN ” to 
n:humn er 
sf 7^ nnasvm clans 
TQ :DG — them(m) 


Would anyone be comfortable arguing that the prepositional phrase combines with anything in the 
clause other than the apposition phrase? 

But matters are not always so clear-cut. Constituent attachment ambiguity is often a real prob- 
lem. Consider the parsing represented in phrase marker (20.6) from Gen 37:3: 


(20.6) c — ,sbjom  — ,nhumn —  ,m is 
obq gram mod cstr son of 
N Dp! old age 
sbj 
gram he 
to + cc 
prep to 
Deed his/him 


In this verbless clause, should the prepositional phrase be a beneficiary CIC in its own right? A 
case for this parse certainly can be made. Ideally, and eventually, both possibilities should be repre- 
sented. As we will show in a very limited way in $20.3, our representational apparatus is designed 
to allow for multiple parses. But, for our initial pass through the corpus, we decided to represent 
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only the “salient reading,” as judged by our human “oracle” (Francis I. Andersen). So, we select the 
preferred parsing and represent it only at this point. 

Subsetting, Supersetting, and Nesting. Identifying and representing these three kinds of se- 
mantics-based structures is unfinished work. There surely are many more instances in which these 
sorts of structures might be usefully marked. In spite of the provisional status of these construction 
types, it nonetheless is interesting that such a large proportion (almost 28%, 51 out of 183) of the 
superset constituents are discontinuous. More than 60% of the discontinuous constituents involve 
the statement of a subject, an intervening CIC, and then the expansion of the subject, usually with 
a pronoun in a coordination phrase. Phrase marker (20.7) from 2 Chr 32:26 is a typical example: 


oblq NN gram 1 and 
bin y 


sbj n:humn 
gram superset 


he was humbled 


yep  Yehizqiyyahu 


loc in + sptl 
— 
gram prep 


"à in 


hw n:sptl my § ; 
cstr lino ee n2 height of 
n:body prt m" 
<-> zb heart 
Bt i his/him 
nn ATI he 


union/disj Mg NE 
3 ^ and 


nchumn Iw? dwellers of 


cair iN 
O»U1wv Jerusalem 


Discontinuous Inverted Modification. | Approximately 29% (591 / 2,072) of the instances of in- 
verted modification involve discontinuity. Discontinuous inverted modification involving nonverbal 
constituents is rare. We find eight instances in CIC daughter constituents.? Rather than display one 
of these, we prefer to show a wonderful non-CIC daughter instance in phrase marker (20.8) from 
Song 6:8 (see p. 299). Here we have discontinuous inverted modification in the first conjunct as 
well as a discontinuous coordination phrase. 

The vast majority of instances of discontinuous inverted modification in CIC daughters involve 
compound predication of a quasiverbal plus a verbal participle or a finite verb. A full discussion 
of compound predication will be provided in our volume on discourse analysis when we deal with 
tense, aspect, mood, and voice as properties of clauses. For now, we represent a quasiverbal plus a 
verbal participle (or finite verb) by combining them into a verbal-participial phrase (502x) or finite- 
verb phrase (80x), as in phrase marker (20.9) from Esth 3:8. The predicator is doubly compounded. 


3. See Gen 31:41, 42:32; Exod 9:6, 27:15; Josh 11:21; 1 Chr 12:15, 23:4, 27:8. 


(20.9) al vb ptc 
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(20.8) cl  ,sbjcmp  , mhumn X , n:humn EX 
oblq M gram union/disj inv mod DWY sixty 
n npn they(f 


gram 


nin queens 


^ and 


DAW eighty 


wwa concubines 


nom cl qv 
nom oblq gram 


gram 


vopte an 


union/dis) ioe 
" 


u^ 


loc misc prep * humn 
a 


gram prep j 
a n:humn n 
— ' 


loc in + geog 


gram prep 
a n:geog 


cstr 


55 


b n:geog 


cstr 


nim 


N n:geog 
suff 


'm3»p 


TI? 


^ and 
ninoy virgins 
TN does not exist 


N sbj — :190n number 


exists 
his/him 
people of 
one 
scattered 
and 
spread 
between 
the 
peoples 
in 

all of 
satrapies of 
kingdom 


thee(m) 


Discontinuous Bonding. The bonding relation is rare (57x). It licenses constructions “X, but 


not Y," or “not X, but Y." Nearly half involve discontinuity, as in phrase marker (20.10) from Job 
32:13: 
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(20. 10) cl sbj n:deity 
oblq gram ^ ^ bnd 5x God 
N : ‘DT he will scatter 

gram 

dir obj - A 

gram io  his/him 
n:humn . 
inv mod N? not 


:U'N man 


Discontinuous Distributed Apposition. This licensing relation involves multiple mothers. See 
§20.2. 


20.1.3 The Incidence of Discontinuity as a Function of CIC Type 

We next document how the incidence of discontinuous constructions varies across CIC types 
(see table, p. 301). Another table gives the incidence of construction types across semantic roles 
(p. 302). Of the frequently occurring semantic roles, these most commonly exhibit discontinuity: 


Percent 
CIC Type Discontinuous 
Purely Verbal Participle 20.3% 
Subject Complement 3.9% 
Subject 
Vocative 
Direct Object 


In the 504 discontinuous verbal participial phrases, the “interpolate” is the subject 470 times (93%). 

With regard to the elevated frequency of other discontinuous CIC types (subject complement, 
subject, vocative, and direct object), we conjecture—but do not prove—that the discontinuous CICs 
are more complex (or as the literature has it, ^heavier") than their continuous brothers. 


20.2 Multiple Mothers: Construct Participles, Distributed Apposition, 
and Ellipsis 


We utilize multiple-mother constructions when we represent these three phenomena: 


* the triple nature of construct participles (109x), 
* distributed apposition (388x), 
* ellipsis without resorting to empty categories (1,186x). 


Since they violate the single-mother condition, the resulting phrase markers are not trees but are 


graphs. We shall take up each phenomenon in its turn. 


20.2.4 The Triple Nature of Construct Participles 
A few participles (109x) have construct morphology and also have a predicative function. We 
call them noun-verb/noun participles, indicating that they are nominal “up front” and both verbal 
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Continuous | Discontinuous Clause 
CIC Type Segmental Phrase Phrase and “Above” 
Impermanent 
Lapsus Calami 24 


Nebulous 9 


Syntactic Isolate 


Vocative 600 971 18 
Exclamative 220 122 28 
Label 88 595 

Predicator 
Inf. Absolute 191 4^ 
Ptcp: N-V/N 109 
Ptcp: N-V 2,327 9 
Ptcp: V 1,858 125 504 
Unanch. Finite V 35,098 218 130 
Anch. Finite V 20,725 9 Is 
Inf. Construct 5,756 [2 
Inf. Utterance 965 
Quasiverbal 1,224 3l 


Grammatical Function 
Subject Address 


Subject 15.705 
Obj. of Address 
Direct Object 10.174 
Second Object 211 288 3° 

Indirect Object 15 
Subject Cmpl 2,677 
Object Cmpl 386 838 13 

Cognitive Cmpl 
Cmpl Aspect T 435 

Operator 

Negative su [ | e —] 
Closed Interg. 86 |S 5 | | 
Modal 236 
Includer Scu 
Restricter T | | 1 — 


a. See Jer 4:31 and Amos 8:14. 

b. Each is a coordinated phrase with two infinitive absolute conjuncts: Deut 9:21, Jer 9:23, Ezek 
1:14, 2 Chr 31:10. 

c. See the compound construction in Ezek 25:7. 

d. In Exod 32:6, we find a coordination phrase with infinitive construct and infinitive absolute 
conjuncts. 

e. Deut 8:16, Isa 48:6, 2 Chr 24:14. 

f. These are “bare” infinitives construct: Exod 2:18; Isa 47:12; Jer 6:11, 15:18, 20:9; Amos 5:2; Ps 
77:3. 
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Semantic Role Continuous | Discontinuous Clause 
CIC Segmental Phrase Phrase and “Above” 
Other Participants 
Accompanier 3 1,657 8 
Agential 3 225 1? 
Alternate / Surrogate 180 
Beneficiary 2 2,083 12 
Exocentric Absolute 17 43 ——ÀÓ3— 
Harmed One 260 1° 
Involved Ones 80 
Aspectualizer 
Infinitive Intensifier 483 19 IE 
Movement 
Aim (Goal) 369 4,152 17 
Direction (Bearing) 11 126 2° 
Interval 80 
Origin 24 4,272 12 
Space 
Area 33 


100 87 


Separation 23 

Time 

Point 535 3,144 21 

Aim (Goal) 4 863 
Interval 367 828 4 
Origin 252 18 
Enriching Constituents 

Cost 

Instrument 18 


Manner 


Material 


Number Count 


Number of Times 
Reference E 1 Jue 


Resoun 241 =—— 
i 

Quantity 49 

Phrasal “Discourse Units” 

Cause 63 

Comparison 2,662 
: 

Quoter 925 
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Semantic Role Continuous | Discontinuous Clause 
CIC Segmental Phrase Phrase and “Above” 
Mixed-Level Constituents 
Aim /Purpose 
“But-Rather” 1” 
Concessive 
Reason 
Undesired Outcome 
a. Gen 28:14. i. Josh 5:2. 
b. Zeph 3:19, Job 34:10. j. Exod 23:14; Deut 16:16. 
c. 2 Chr 32:25. k. Deut 32:42; Lev 1:10; Num 31:28, 30, 42. 
d. Ps 126:6. l. Isa 35:4; Amos 5:16. 
e. Josh 18:13, Isa 2:8. m. | Kgs 12:20. 
f. Judg 11:33, Ezek 46:23. n. Deut 4:12. 
g. Prov 8:23. o. Ezek 32:30. 
h. Isa 10:7. 


and nominal *out back." Rather than ignore the construct morphology, we choose to represent both 
the construct behavior and the verbal behavior as is shown in phrase marker (20.11) from Gen 4:15: 
(20.11) 


nom ptc , nom pte — , n-v ptc 
cstr oblq gram 


dir obj . 
A Sa rls —— pp Qain 


The left-most node encodes the fact that a construct relation is present. A sequence of three edges 
runs along the top of the diagram from this node to the nomen regens segment; a single edge runs 
from the left-most node to the nomen rectum segment. Hence, the nomen rectum segment has two 
mother nodes, making the structure a graph rather than a tree. 

The next node to the right indicates that predicative behavior is present. A predicator (n-v 
ptc / gram) and a direct object (dir obj / gram) are both daughters of the nom ptc / oblq node. 


— Xn slaying of 


20.2.2 Distributed Apposition 

To quote our glossary, “distributed apposition licenses the assembly of apposition constructions 
wherein a later constituent is in apposition to two or more earlier constituents considered together. 
The resulting construction exhibits multidominance and discontinuity." Previously, we mentioned 
"distributed apposition" in passing ($5.4 and $6.4) and provided two incidental examples while 
discussing other phenomena: in $7.2.2 (1 Sam 31:2, two segments in distributed apposition with 
a final phrase) and in $8.1.3.4 (1 Chr 22:13, two basic phrases in distributed apposition with a 
lengthy nominalized clause). These phrase markers should be examined afresh. 

Distributed apposition occurs in these kinds of constructions: 
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* Rare are segments and phrases in distributed apposition with a final segment: 


1. Two segments: Exod 37:25, Ezek 40:47 

2. Nine segments: Ezra 8:16 

3. Two phrases: Exod 38:1, Lev 9:3, Ezek 42:5 
4. Three phrases: Num 28:14 


* Less rare are segments in distributed apposition with a final phrase: 


1. Two segments: 41 instances 

2. Three segments: Exod 32:13, Num 32:34, Josh 15:14, 1 Sam 31:2, Zech 6:10, 
1 Chr 10:2 

. Five segments: Num 31:8, Josh 13:21 

. Ten segments: Esth 9:7-10 

5. Fifteen segments: 2 Chr 11:6-10 


BW 


* More common are phrases in distributed apposition with a final phrase: 


1. Two phrases: 94 instances 

2. Three phrases: Lev 9:10, 20:25; Deut 7:11, 14:6, 16:11; Jer 8:2, 24:8; Neh 1:7, 
2 Chr 23:9 

. Four phrases: Lev 23:38; Deut 16:14, 28:27; Jer 27:19-20, 29:1; 1 Chr 22:16 

. Five phrases: Jer 27:9 

5. Six phrases: Ezek 36:4 


A U 


e Mixed constituents occur 15 times: 


1. with a final segment: 
a. Phrase and segment: Lev 9:2 
2. with a final phrase: 
a. Segment and phrase: Deut 4:47; 2 Sam 4:9; Jer 49:28; Zech 1:12; Ps 26:9, 
95:8-9, 147:7-9; Song 8:14 
b. Phrase and segment: Jer 10:9, Ezek 36:5, Nah 3:17, Qoh 12:1 
c. Phrase, segment, phrase: 2 Chr 20:10 
d. Segment, Segment, Segment, Phrase, Segment, Segment, Segment, Segment: 
Num 32:34 


20.2.3 Ellipsis 
20.2.3.1 The Traditional Definition of Ellipsis 
Here is a traditional definition of ellipsis, from Trask: 


Any construction in which some material which is required for semantic interpretation and which 
could have been overtly present is absent but immediately recoverable from the linguistic context, 
particularly when that material is overtly present elsewhere in the sentence.* 


4. R. L. Trask, A Dictionary of Grammatical Terms (London: Routledge, 1993) 89. 
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Most of the time, this definition suffices. But, as we shall see, there are contexts in which it is too 
simplified, since “[e]llipsis is a discourse phenomenon, in the sense that the interpretation of the 
missing constituent sometimes depends on something said in an earlier sentence—possibly even by 
another speaker."? Indeed, the interpretation of the missing constituent may lie even further afield. 


20.2.3.2 Standard Biblical Hebrew Reference Works on Ellipsis 
The standard reference works have little to say regarding ellipsis: 


e GKC^ has a few brief references to the topic: 


1. $117f gives eight instances of omitted pronominal objects. 

2. §116s takes up omitted personal pronouns in participial clauses. 

3. §134n asserts that "[c]ertain specifications of measure, weight, or time, are commonly 
omitted after numerals." 


* Waltke and O'Connor touch lightly on ellipsis in a three-page discussion of some uses of 
prepositions.” 

e In van der Merwe et al., ellipsis appears only in a footnote: “The complement of a verb 
may be omitted, but then only when it can be inferred from the context of the sentence.” ’ 

* Joüon/Muraoka have scattered passing references to the ellipsis of: objects, subjects, time 
words, measure words, participles, protases, and apodoses.? 

e Williams has a three-and-one-half page concluding section on ellipsis. '? It is only slightly 
generalized from what Joüon /Muraoka have. 


All in all, these reference works “address” ellipsis in a perfunctory, echo-chamber sort of way. 


20.2.3.3 Our Approach to Representing Ellipsis 

In deciding how to represent ellipsis, we began with the traditional definition. If it was true that 
the missing material was overtly present nearby, then why not represent this fact by joining the "ex- 
ternal material" to the clause exhibiting ellipsis, running an edge from the shortened clause to the 
external material? Resolving ellipsis explicitly required having nodes with multiple mothers, but 
we already had introduced them for dealing with construct participles and distributed apposition. 

Forward Ellipsis. Phrase marker (20.12) from Isa 34:13 illustrates our representational strat- 
egy for the usual (forward) ellipsis (also known as “analipsis”). We draw an edge from the second 
clause's root node up to the predicator node in the first clause, indicating that the predicator does 
service in both clauses. The more common way of representing ellipsis is to have an empty node!! 
in the second clause and to co-index it with the predicator in the first clause. 


5. I. A. Sag, T. Wasow, and E. M. Bender, Syntactic Theory: A Formal Introduction (2nd ed.; Stanford, CA: CSLI, 
2003) 416. 
6. B. T. Arnold and J. H. Choi (A Guide to Biblial Hebrew Syntax [Cambridge: Cambridge University Press, 2003] 
192) repeat the GKC observations, adding that “[e]llipsis also occurs frequently with negatives." Where Arnold and 
Choi see ellipsis of negatives, we see scoping of negatives (see $9.3.5.1). 
7. Waltke and O'Connor, /BHS, 222-25. 
8. Van der Merwe et al., 241. 
9. Joüon and Muraoka, passim (14 articles). 
10. Williams, 208-11. 
11. The empty node occupies the place where the ellipted constituent “should be.” This sort of locus is known as a 
gap, “a location in a sentence in which no element is overtly present even though some element appears to be in some 
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(20.12) sen , d di seq w ] 
juxt oblq S. gram | and 
LI 
\ vb . 
i gram nn?p she will ascend 
i loc n:bidg -—- ! . . 
i gram suff aji fortifications 
\ m her 
\ sbj 
i gram ovo thorns 
ya — sbj n:veg esis 
- oblq gram union/disj — — — — Vp nettles 
^ and 
nin thorn 
loc in * bldg w 
gram prep N 3 in 
n:bidg 
suf ~~ "Nan fortresses 


n her 
The verb in the first clause does “double duty,” being the explicit predicator in the first clause and 
Backward Ellipsis. 


the ellipted predicator down in the second clause. We find 1,135 instances of forward verb ellipsis. 


A verb may be missing in a first clause, being supplied in a second clause. 
This situation involves backward ellipsis, also known as “catalipsis.” Phrase marker (20.13) from 
Ps 70:2 illustrates the phenomenon. We have found 51 instances of backward verb ellipsis. 
(20.13) sent. cl — , voctv 

juxt 


* oblq gram 
S aim to * inf cstr 
——— 
gram 


‘ 
4 
\ prep 
LI 

* 

LU 

LI 

* 

‘ 

4 

LI 

LI 

LI 

4 

LI 

‘ 

LI 

LI 

4 

7 


DDN God 
nom inf 


r 
inf cst 
oblq 
cl 


? to 

NX gam —— — — — yn deliver 
voctv 

oblq gram 


dir obj s 
gram 49 me 
———————— mm  Yahweh 
ind obj to * humn % 
gram prep i 
Oa n humn 


to 
suff -7 TUY help 
"^5 me 
vb :7Win  thou(m)hasten 
gram dod. 


sense grammatically required" (Trask, A Dictionary, 114). 
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In an ongoing series of papers, Cynthia Miller has been investigating and characterizing the 
various kinds of ellipsis. In this subsection, we comment on her generalizations regarding verb el- 
lipsis.'? We are not yet in a position to assess Miller's work for two reasons: (1) we have not yet 
carried out consistency checking on our own ellipsis markup; (2) as Miller demonstrates, '^ the 
parsing of clauses that may involve ellipsis often involves resolution of ambiguity. 

To understand the second point, consider phrase marker (20.14) from Nah 3:10, where we have 
parsed the two asyndetic clauses as involving ellipsis. The first clause, however, might be a verb- 
less clause. The Nps renders the clause as though it were verbless: “Yet even she was exiled,” as 
does the NRsv: “Yet she became an exile." Nonetheless, we are comfortable with our parsing for 


reasons that might be adduced. Both interpretations are defensible. As we will glimpse in $20.3, we 
are able to represent both parses. But, for now, we show only our preferred parse 
(20.14) sent ^ cl 

juxt 


> sbj 
oblq 


, pers pr -p; 
gram inv mod i ee - i 
\ xI she 
1 . 
i mvt aim to * humn ^ to 
d gram prep 
\ desta ——— © the 
\ hs nhi exile 
i cl vb 
oblq gram nan 
bs loc in * war 
— > 
gram 


she walked 


—————— ^ 
prep 


—— — 'o the 
N "2 captivity 
Miller's Three Universal Conditions? (“must be present for ellipsis to operate" ): 
l. 
2. 
3. 


"[E]llipsis operates on coordinate structures [be they syndetic or asyndetic].” 
T 


"[T]he two halves of the coordinate sentence must . . . match exactly. 


716 
"[T]he verb that is present and the verb that is deleted must be lexically identical. . . . 
However, the two verbs need not be identical in person, gender, or number." 


12. C. L. Miller, “A Linguistic Approach to Ellipsis in Biblical Poetry (Or, What to Do When Exegesis of What 
Is There Depends on What Isn't)," BBR 13 (2003) 251-70. On another kind of ellipsis, see idem, “Ellipsis Involving 
2005) 37-52. 


Negation in Biblical Poetry,” in Seeking Out the Wisdom of the Ancients: Essays Offered to Honor Michael V. Fox on the 
Occasion of His Sixty-Fifth Birthday (ed. R. L. Troxel, K. G. Friebel, and D. R. Magary; Winona Lake, IN: Eisenbrauns, 


13. Consistency checking is an area that we are only beginning to enter. See A. D. Forbes, “The Challenge of Con- 


sistency,” Computer Assisted Research on the Bible in the 21st Century: Proceedings of the Eighth AIBI Conference, El 
Escorial, Madrid, June 2008 (ed. L. Vegas, G. Seijas, and J. del Barco; Piscataway, NJ: Gorgias, 2010) 99-115. 
14. Miller, “A Linguistic Approach," 255-60. 
15. Ibid., 260-62. 


16. In her n. 30, Miller specifies two situations in which this condition may be violated. 
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These conditions have great merit, but they are more “rules-of-thumb” than laws. Our parse of Nah 
3:10, for example, disobeys condition #2. Consequently, we are more inclined to say that the condi- 
tions are very useful generalizations than to use them to exclude certain parses. 

Distinctively Hebrew Features of Ellipsis: 

1. "[E]llipsis may occur in the first line, a situation known as backwards ellipsis.” 

2. “Ellipsis may occur when the verb is in initial, medial, or final position with respect to the 

other clausal constituents.” 
3. "[B]ackwards ellipsis only occurs when the verb is in final position.” 


4. “The order of constituents may be chiastic, unless the verb is in final position." " 


As statements of realized possibilities, the first, second, and fourth features are demonstrably true. 
Examples illustrating each statement lie ready to hand. It is the third assertion that gives us pause. 
We must question it on two grounds: cross-linguistic and empirical. 

Cross-Linguistic. Miller tells us that "this observation about backwards ellipsis corresponds 
to what we know about ellipsis cross-linguistically."!* We find contrary evidence. Using Sanders's 
six-type classification of ellipsis patterns, Haspelmath asserts that “no ellipsis type is universally 
impossible, but there are strong restrictions on which combinations of ellipsis types a language can 
have." '? He even cites one language that allows all six types of ellipsis: Tojalabal. 

Empirical Evidence. More convincing is the empirical evidence. Consider phrase marker 
(20.15) from Ps 77:2 in which the verb is second-clause initial, contrary to Miller's feature #3: 


(20.15) set — . d instr n:vocbl Es 
union/disj ~~ oblq gram suf |. ^" Dip voice 


"$5 me 


ind obj to*deiy — «-6 s» -bg unto 
gram prop Bi cc 

DDN God 
A TEENE. Eee ee eee aE EOS ee UENIRE > 4 and 


— N Iry 


q NC gram 
instr n:vocbl ys ; 
— Ho —  — — 
el suf Dip voice 


"5 me 


ind obj to + deity Unto 


pem prep cu 
DDR God 


17. Ibid., 262-65. 

18. Ibid., 264. 

19. Martin Haspelmath, “Coordination,” in Language Typology and Syntactic Description, vol. 2: Complex Con- 
structions (2nd ed.; ed. Timothy Shopen; Cambridge: Cambridge University Press, 2007) 44—45. Haspelmath is describ- 
ing ellipsis in general. 
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The Nnyps translates: “I cry aloud to God; I cry to God.” Parsing the first clause as verbless seems 
forced to us, given that the two clauses are identical except for the verb. Other examples of back- 
ward ellipsis where the predicator is not second-clause final do exist. See, for example, Gen 14:23; 
2 Kgs 7:18, 25:15 (= Jer 52:19b); Isa 3:8—9; Jer 6:20b. 
Multiple Ellipsis. It is fairly common for multiple items to be ellipted. An example is shown 
in phrase marker (20.16) from Gen 28:20, where both the verb and the indirect object are ellipted. 
(20.16) sent — . cl oO, 
union/disj oblq 


pi "Ii he will give 
R pons —> to ug ^) to 
i t nu n o" 
i M pes, Dr? bread 
d & Sid EN to pea b to 
i & li on 5 SSR pian — bbg eat 
i ILLE >i and 
“cola gam "3 garment 
an —, t ee % to 
E pni — pela ——»:U35 dress 
Some of our clause pairs marked as containing ellipsis involve not a little exegesis and may be 
analyzed along more conventional lines upon further consideration. Phrase marker (20.17) from Ps 
89:3 shows what we are talking about. 
(20.17) E — Rut DAN | said 
d | e 
pin "md jx Hn obig pol — pip eternity 
\ N SL ws oa 
X Ron Tz he will be built 
\ Es "amd lud DV heaven 
\ oan i25 thou(m) wilt appoint 
plos, jer nDN reliable 
loc 


"(2  thee(m) 

in * pron 
— 

resum 


prep Ban i . 


:DBi  them(m) 
This is the only instance in the MT where a pronoun suffix has two mothers. . . . A bridge too far? 
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20.3 Multiple Roots: Ambiguity and Multiple Parses 


Elsewhere in this book, we have discussed lexical ambiguity ($1.1.3.3 and appendix §A1.3.3), 
structural ambiguity ($1.1.3.4), and clause-boundary ambiguity ($2.2.2). We have also published 
two papers addressing the problems of ambiguity, especially structural ambiguity.?? 

In this subsection, we will use a simple example to illustrate how our representation can depict 
structural ambiguity. Consider the parsing(s) of Amos 1:2a: 


INW? [Pn mm 
Taken in isolation, this clause is ambiguous.?! The ambiguity is explicit in that the clause can be 
rendered by two translations, involving different CICs: 
Yahweh, from Zion, has roared (sbj loc pred). 
Yahweh-from Zion has roared (sbj pred). 
Suppose that we wish to visualize both parses. We might show two separate parses, or we might 
combine the possibilities as is shown in phrase marker (20.18). 


(20.18) 


sbj n:deity 


cl, 
oblq gram mod Bn 
cl, sbj 


——- m Yahweh 
oblq gram 


mvt orig from*geog L » , Z& —»0  , 2 from 


gram prep D 
IP» Zion 


vb 
gram 


ANV" he roared 


Here cl, and its offspring represent the first parse (involving three CICs). Cl, and its offspring em- 
body the second (involving two CICs). The phrase marker has two roots. Two of its nodes have two 
mothers, and it exhibits tangling. Clearly, this phrase marker is not a tree. 


20.4 Brief Summary 


Discontinuity. We investigate parses in which phrase marker edges unavoidably must cross. 
Regarding licensing relations, some licensing relations essentially disallow discontinuity: definite, 
prepositional, suffixation, and construct. With the other licensing relations, there is discontinuity. 
Its frequency can range from rather rare (union, 4%) to predominant (distributed apposition, 74%). 
The largest counts are for inverted modification (591x) and apposition (529x). Regarding CIC 
subtype, verb participles in periphrastics exhibit the most discontinuity (504x, 2096). Regarding 


20. E I. Andersen and A. D. Forbes, “Syntactic Ambiguity in the Hebrew Bible,” in Proceedings of the Fourth In- 
ternational Colloquium: Bible and the Computer—Desk and Discipline (Paris: Honoré Champion, 1995) 356—67; idem, 
"Attachment Preferences in the Primary History," in Proceedings of the Sixth International Colloquium: From Alpha to 
Byte (Leiden: Brill, 2002) 167-86. 

21. The next clause is not ambiguous and drives the parse toward the first interpretation: ip jr poor. This is 
the parse opted for in our disambiguated current rendering of the text. 
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GFs, subjects have the largest number of discontinuities, albeit resulting in a low overall percent- 
age (396x, 1.2%). Regarding SRs, discontinuity is quite rare, perhaps because these constituents 
typically follow the core constituents. The location SR is discontinuous more often than any other 
SR (69x, 0.5%). 

Multiple Mothers. Constituents have multiple mothers in our representations of construct par- 
ticiples (109x), distributed apposition (388x), and ellipsis (1,186x). After examining treatments of 
ellipsis in the reference works, we provide examples of forward and backward ellipsis. We then 
examine Cynthia Miller’s approach to verbal ellipsis, finding her “universal conditions” and “dis- 
tinctive features” illuminating but treating them as rules-of-thumb rather than strict constraints. (We 
do take issue with one of her features, finding exceptions to her assertion that “[b]ackwards ellipsis 
only occurs when the verb is in final position.") 

Multiple Roots. We show how we have made provisions for handling ambiguity and multiple 
parses. To date, we have nowhere relied upon our ability to have phrase markers with multiple 
roots. 


Chapter 21 


Discourse Analysis and 
Supra-Clausal Structures 


Our primary goals in this chapter are first to provide an outline of our planned approach to 
discourse analysis (§21.1) and then to illustrate how our barely begun work on discourse analysis 
has affected the structure of the current phrase markers (§21.2). We will also note two additional 
respects in which our phrase markers represent “work in progress.” We will then introduce the 
supra-clausal constituents (SCCs) and cue phrases that appear in our supra-clausal representation 
(§21.3). Finally, we will expose the problem of “paradoxical intra-clausal discourse units” (§21.4). 


21.1 Introduction to Our Perspectives on Discourse Analysis 


We touched on discourse analysis very briefly in $1.2.3.3, on the supra-clausal “level” in $5.1, 
and on one kind of discourse construction, the adverbial subordinated clause, in $8.3. We now turn 
to a more detailed introduction to our approaches to discourse analysis. In this subsection, we will 
sketch our approach to discourse analysis (DA), an approach that we hope to develop in detail in a 
later volume. 


21.1.1 Definitions of Discourse Analysis 

Introducing The Handbook of Discourse Analysis, Schiffrin et al. observe that “[s]o abundant 
are definitions of discourse that many linguistics books on the subject now open with a survey of 
definitions.”' They comment that the many definitions of discourse fall into one of three super- 
classes: “(1) anything beyond the sentence, (2) language use, and (3) a broader range of social 
practice that includes nonlinguistic and nonspecific instances of language." We focus on linguistics- 
based approaches to super-class (1). 

The terminology of DA is chaotic. Different analysts use terms in different ways. Kruijff- 
Korbayová and Steedman write of the need to “transcend the difficulties caused by proliferating 
terminologies."? Here, we will attempt to define our own terms clearly and use them consistently, 
indicating how others use these terms differently or use different terms roughly equivalent to ours. 
Our emphases in analyzing texts are on communication (not acquisition) and on recognition (not 
generation). These emphases affect our assessment of the approaches taken by discourse analysts. 


1. Deborah Shiffrin et al., The Handbook of Discourse Analysis (Oxford: Blackwell, 2001) 1. 
2. Ivana Kruijff-Korbayová and Mark Steedman, Information Structure, Discourse Structure, and Discourse Se- 
mantics: Workshop Proceedings (13th European Summer School in Logic, Language and Information, 2001) v. 
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21.1.2 Eight Aspects of Discourse Analysis 
We will now explain our approach to discourse analysis in terms of these eight aspects: 


. interface with syntax 

. text types analyzed 

. primary foci 

. basic units of analysis 

. dimensions of discourse 

. Tepresentation(s) 

. analytic procedures 

. rule and knowledge bases 


oo 10g tn A U0rn(-— 


21.1.2.1 Interface with Syntax 

Webber et al. describe three stances regarding the relation of sentence syntax and DA: narrow- 
focus, disjoint, and unified.* 

Those who take the narrow-focus stance try to represent discourse phenomena within sentence- 
level grammar. We know of no successful narrow-focus analyses. Those who take the disjoint ap- 
proach seek to “use a completely different approach to discourse-level syntax and semantics than to 
sentence-level syntax and semantics, combining (for example) a definite clause grammar with rhe- 
torical structure theory.”* Those who adopt the unified stance “recognize the overlapping scope and 
similar mechanisms [at sentence and discourse levels] and simply extend a sentence-level grammar 
and its associated semantic mechanisms to discourse."? Both disjoint and unified approaches con- 
tinue to be researched. Our hunch is that the way forward will combine both disjoint and unified 
approaches, relying more on the latter. 


21.1.2.2 Text Types Analyzed 
As can be seen from appendix §A7.3, even setting aside the poetry/prose distinction, text type 
assignment can become quite elaborate. We distinguish these four rough-and-ready text types: 


1. exposition (“an explanation") 

2. narration (“a story") 

3. indirect speech (“he said that . . .") 
4. dialogue 


21.1.2.3 Primary Foci 
Linguistics-oriented discourse analysis focuses on one or more of the following: 


. information structure 
. discourse cohesion 
. discourse structure 
. discourse semantics? 


BRWN eR 


. Bonnie Webber et al., “Anaphora and Discourse Structure,” Computational Linguistics 29 (2003) 572. 
. Ibid. 

Ibid. 

. For good or ill, these terms are often not truly mutually exclusive. 
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Information Structure. “Information Structure is a [Clause] Internal partition of the informa- 
tion in an utterance according to its relation to the discourse context under dichotomies such as 
topic / comment, theme /rheme, given /new, focus / background etc. Such categories are essentially 
Referential in nature.” 7 

Discourse Cohesion. Halliday and Hasan “designed [discourse cohesion] to move beyond 
the structural resources of grammar and consider discourse relations which transcend grammatical 
structure. . . . [Cohesion is achieved via] reference, ellipsis, substitution, conjunction, and lexical 
cohesion.” $ 

Discourse Structure. “Discourse Structure concerns the Inter-clausal relations of explana- 
tion, elaboration, exemplification, and illocutionary force that hold between successive utterances 
of a discourse or dialog, supporting inference about the domain and purposes of the discourse.” ? 
“[Discourse Structure] thus subsumes notions such as [discourse] segmentation, relations between 
[discourse] segments (informational and intentional), anaphoric relations, modal subordination, dis- 
course topic, thematic progression, etc." !° 

Discourse Semantics. “Discourse Semantics centrally concerns the nature of the contextual 
model, and the entities in it to which Information Structure categories relate. . . ."!! 


21.1.2.4 Basic Units of Analysis 

We refer here to the problem of discourse segmentation. Or, as Polanyi asks: “what are the 
atomic units of discourse?" ? For most researchers, a starting point is the main clause. One variable 
is whether the basic units include or exclude cue phrases. Discourse units can be: 


. clauses with any preposed cue phrase included as part of the basic unit 

. clauses with any preposed cue phrase external to the basic unit 

. certain phrases 

. certain fragments "[whose] full interpretation remains unrecoverable from surrounding 
context." ^ 


R UN m 


Discourse units are assigned various names. The most common terms are discourse unit (du), el- 
ementary discourse unit (edu), and elementary discourse constituent unit (e-dcu). 


7. Kruijff-Korbayová and Steedman, Information Structure, 1; emphasis in original. 

8. James R. Martin, “Cohesion and Texture,” in The Handbook of Discourse Analysis (ed. D. Shiffrin et al.; Oxford: 
Blackwell, 2001) 36. According to Martin, later work recast “non-structural” cohesive phenomena as discourse semantic 
structures. 

9. Kruijff-Korbayová and Steedman, Information Structure, 1. 

10. Ibid., v. 

11. Ibid., 1. 

12. Livia Polanyi, “The Linguistic Structure of Discourse,” in The Handbook of Discourse Analysis (ed. D. Shiffrin 
et al.; Oxford: Blackwell, 2001) 265. 

13. Cue phrases are linking words such as if, because, since. They are also called discourse markers (quite com- 
mon), discourse items (R. L. Trask, Dictionary of Grammatical Terms [London: Routledge, 1993] 84), and extrapropo- 
sitional discourse operators (Polanyi, “Linguistic Structure of Discourse,” 265). See §21.3.2. 

14. Livia Polanyi et al., “A Rule Based Approach to Discourse Parsing,” Proceedings of the 5th SIGdial Workshop 
on Discourse and Dialogue (2004) 111. 
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21.1.2.5 Dimensions of Discourse 
The relations among the discourse segments can be described in terms of multiple concurrently 
valid dimensions. What we call dimensions, most analysts call levels. Levels imply mutual exclu- 
sivity and hierarchy. To avoid this unwanted implication, we introduce the notion of dimensions. 
The following dialogue between speakers A and B illustrates our point on concurrent validity: 


(1) A: George collects coins. 
(2) B: No he doesn’t. 
(3) A: He has examples of every kind of U.S. dollar coin. 


Statement (2) purports to be a correction of statement (1). Statement (3) is an elaboration of state- 
ment (1), but it also is counterevidence against statement (2).'° Thus, statement (3) is concurrently 
in two different relations to two different statements. 

Four dimensions have been proposed for carrying out discourse analysis. Our preferred termi- 
nology for them is as follows: 


1. Informational.' Relations that convey information advancing the discourse (example: 
elaboration). 

2. Intentional." Relations that describe attempts to alter behavior, beliefs, feelings 
(example: motivation). 

3. Textual.'® Relations involving adjacencies that are simply imposed by the presentation 
medium (example: presentational sequence). '? 

4. Exchange.? Relations in dialogue that involve turn-taking stimulus and response 
(example: question-answer). 


There are situations in which all four dimensions are relevant. Consider this dialogue: 


(4 A: Why doesn’t the engine work? 
(5) B: You need to buy a new fuel filter, and you need clean gas. 


Discourse analysts would all agree that we have three discourse units here: one clause uttered by 
speaker A and two clauses uttered by speaker B. 

The Standard Analysis. The exchange relation between A and B is question-answer (B an- 
swers A's question). There is a conjunction textual relation between the two clauses uttered by B 
(the ordering of the clauses in B's response could be reversed, but both are required to answer the 
question fully). The intentional relation between A and B is motivation (B wants A to purchase 
the items needed to repair the engine). The informational relation between A and B is (indirect) 
cause (apparently, dirty gas has fouled the fuel filter; these factors together cause the engine not to 
function). 


15. In this subsection, the names of discourse relations are underlined (e.g., justify). 

16. Also called semantic, subject-matter, and ideational. 

17. Also called presentational and interpersonal. 

18. Also called presentational. This is similar to what some refer to as schema. 

19. This numbered list (informational, intentional, textual, exchange) exhibits an arbitrary presentational sequence. 

20. We use exchange rather than the usual adjacency pairs, since the related segments may be neither adjacent nor 
pairs. 
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What about categories? The foregoing analysis makes no attempt to distinguish between cat- 
egories of discourse units and the relations among them. All of the analytical weight is borne by 
relations. In clause-level syntax, we identify the classes that entities belong to, and we then investi- 
gate the relations that can hold among them. Must discourse-level syntax be different? 

Consider the exchange dimension. In their analysis of the exchange dimension (which they 
term adjacency pairs), Stent and Allen distinguish 11 different relations: greeting-response, sum- 
mons-response, question-answer, question-no answer, assertion-acknowledge, assertion-modify, 
assertion-reject, assertion-hold, proposal-accept, proposal-modify, and proposal-reject.?' This list is 
woefully incomplete. For example, one might well observe greeting-no answer (“How are you?” 
[silence]), greeting-modify (*Hello, Bob.” “Bill, actually."), or greeting-reject (“Good to see you." 
“Who you kidding?"), and so on. By our analysis, 30 relations are possible. 

Each of Stent's and Allen's relations can be viewed as consisting of 1 of 5 stimulus acts hy- 
phenated with 1 of 7 response acts. If we collapse response into answer and replace no answer 
with silence in the list of responses, then we have 5 stimuli and 6 responses, yielding 30 possible 
exchange (i.e., adjacency pair) relations. It is more parsimonious to distinguish 5 stimulus and 6 
response exchange "parts of discourse" than to have 30 exchange relations. This leads one to ask 
if the category of the exchange might not be viewable as one feature (partially) defining a part of 
discourse analogous to a part of speech (POS) in clause-level syntax. 

Or, consider the intention dimension. In “The Standard Analysis" above, we asserted that the 
intentional relation between A and B is motivate. But we might say that the intention feature on 
part of discourse (5) has the value motivate. In general, our preference is to assign dimensional 
features to parts of discourse rather than to identify the relations between them. As in our approach 
to clause syntax, we adopt a constituent perspective rather than a dependency perspective. 

Modern syntacticians view grammatical classes as bundles of features. “Treating categories as 
bundles of features makes it possible to represent large numbers of grammatical categories quite 
compactly, since every different combination of features and values is a different category."? 


21.1.2.6 Representation(s) 
Several methods of representing discourses have been used. Shorn of their particularities, they 
fall into these four major types: 


. Text descriptions—exemplified by the paragraph “The Standard Analysis" above. 

. Trees (indented lists, labeled trees)—introduced in §1.3.2. 

. Labeled graphs—taken up in chap. 20. 

. Discourse representation structures— "an intermediate level of semantic representation . .., 
derived by an algorithm from the syntactic structure of sentences." ? This method is quite 
beyond the scope of the present survey.?* 
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21. Amanda J. Stent and James E Allen, Annotating Argumentation Acts in Spoken Dialog, 27 (Technical Report 
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21.1.2.7 Analytic Procedures 
Analysts rely on one of the following four procedures: 


1. “expertise” (decided by a human expert) 

2. directives from annotation manual (decided by an analyst following a protocol) 

3. syntactic, semantic, and pragmatic rules (via hard logic, heuristically weighted evidence, or 
probabilistic discrimination) 

4. evidence fusion across multiple knowledge bases 


Early discourse analysis relied on the intuitions of its practitioners. Seeking predictability, analytic 
protocols were written and used. Some have sought to remove the human analyst from the process 
by defining sets of objective rules and devising schemes for combining evidence. Others have relied 
on computational manipulation of multiple knowledge bases. 
21.1.2.8 Rule and Knowledge Bases 

Evidence for analysis is extracted from several kinds of sources: 
. clausal syntactic and semantic information 
. semantic networks 


. cognitive models 
4. world knowledge 


WN — 


Traditional syntactic and semantic information has been exploited to make robust analyses. For 
tracking discourse lexical cohesion, “lexical reference resources" have proved very useful.” In an 
effort to infer information regarding intentions, various analysts have exploited cognitive models. 
When nothing else has sufficed, they have resorted to world knowledge. 


21.1.2.9 Summary Matrix 

Each of the foregoing eight characteristics of approaches to discourse analysis can take on one 
or more values out of three or four available values. Therefore, corresponding to any given ap- 
proach to DA a summary matrix will be provided, an unspecified version of which is shown below. 
If an approach exploits a particular item, then this item will receive a check mark in the descriptive 
DA matrix (see table, top of p. 318). The matrix includes two additional specifications in its left- 
most column: 


1. If the approach relies on punctuation in defining its basic analytical units, then the 
“punctuation feature” (first column, fourth row) will be set to +punct; if punctuation is not 
relied upon, then the feature value will be set to —punct. 

2. If the approach allows multiple dimensions to be concurrently active, then the “concurrency 
feature” (first column, fifth row) will be set to +conc; if concurrency is not allowed, then 
the feature will be set to —conc. 


21.1.2.10 Ideal Characteristics of Discourse Analysis of the Hebrew Bible (DAHB) 
We populate a DA matrix with the constellation of check marks that corresponds to ideal dis- 
course analysis for the Hebrew Bible (DAHB), as we envision it (see table, bottom of p. 318). 


25. Christiane Fellbaum, ed., WordNet: An Electronic Lexical Database (Cambridge, MA: MIT Press, 1998) 7. 
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Unspecified Discourse Analysis Matrix 


Interface with Narrow-Focus Disjoint Unified 

Syntax 

Text Types Exposition Narration Indirect Speech Dialogue 
Analyzed 

Primary Information Structure Discourse Structure | Discourse Semantics 
Foci ERN 

Basic Clauses incl. Cue Phr. Certain Phrases Certain Fragments 


Units (+punct) 


Dimensions of Informational Intentional Textual Exchange 
Discourse (+conc) 

| Representation(s) Text Tree Graph DRS* 
Procedures "Expertise" Annotation Manual Rules and Logic Evidence Fusion 
Rule and Clausal Information Semantic Networks Cognitive Models World Knowledge 


Evidence Bases 


a. DRS - Discourse Representation Structure. 


Interface with 


Andersen-Forbes Proposed DAHB Matrix 


Unified 


Narrow-Focus 


Syntax J 
Text Types Exposition Indirect Speech Dialogue 
Analyzed WA WA J J 
Primary Information Structure | Discourse Cohesion | Discourse Structure | Discourse Semantics 
Foci J J "A J 
Basic Clauses incl. Cue Phr. | Clauses excl. Cue Phr. Certain Phrases Certain Fragments 
Units (—punct) if J 
Dimensions of Informational Textual Exchange 
Discourse (+conc) J J J J 
Representation(s) Text Graph DRS 
V V 
Procedures "Expertise" Annotation Manual Rules and Logic Evidence Fusion 
4 4 
Rule and Clausal Information Semantic Networks Cognitive Models World Knowledge 
Evidence Bases J J "i V 


The matrix discloses that our approach to discourse analysis: 


1. seeks a unified interface with syntax, 
2. analyzes all four major text types, 
3. focuses on the four traditional foci (downplaying information structure to some extent), 
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4. does not rely on punctuation clues, 

5. considers clauses (without cue phrases) to be the basic units (with certain phrases and 
certain fragments as basic units as well), 

6. uses all four of the dimensions of discourse that we have identified (allowing concurrent 
realization), 

7. uses graphs and the DRS for representation, 

. relies on rules and logic as well as evidence fusion, and 

9. extracts evidence (to varying extents) from each of the four identified sources. 


oo 


21.2 Our Phrase Markers as a "Work in Progress" 
Our phrase markers are incomplete in these respects: 


1. Assembly and labeling of discourse structures have scarcely begun. 

2. Some of our analytical categories have not yet been propagated through the database. 
3. The representation is mixed (see $9.2.1). 

4. Constituent features are insufficiently propagated up through the phrase markers. 

5. Errors and inconsistencies in the phrase markers are continually being corrected. 


21.2.1 Provisional Representation of Discourse Form and Function 


21.2.1.1 The Sancta Clause Parade 

Our original intention was for our phrase markers to consist only of main clauses, saving indi- 
cation of all discourse structures for much later. Based on this plan, the phrase marker for Sisera's 
speech in Judg 4:20 would consist of seven or eight isolated clauses (depending on how we handled 
m. 
21.2.1.2 Gathered Speeches 

In practice, the resulting display of isolated clauses was disconcertingly “jerky.” And so, we 
tried simply binding embedded speeches together using dashed edges. Based on this plan, the phrase 
marker for Sisera's speech looks like (21.1) (see p. 320).?6 In this display, each clause is identified, 
and the speech is gathered under one obj addr node in the phrase marker. The dashed edges indicate 
that the analysis is provisional. The phrase marker does not show how the clauses are related. For 
example, the presence of the cue phrase DX ‘if’ implies that one of the two subsequent 1-segments 
is likely a “waw of apodosis."? But the phrase marker shown does not indicate which 1 is the “waw 
of apodosis." 
21.2.1.3 Skeletal Indication of Discourse Structure 

In light of our preliminary work on discourse, we have decided at this stage to indicate how 
some clauses interrelate with regard to the form of the discourse. Much less frequently, we represent 


26. This display can be obtained with the Logos Research Systems program by bringing up our phrase markers, 
selecting the phrase marker window, clicking on Display, and then deselecting “Supra-Clausal Structures.” 

27. That is, one of the 1-segments is the cue phrase indicating the onset of the apodosis (the "then-unit") corre- 
sponding to the protasis (the "if-unit") cued by the oX. See P. Joüon and T. Muraoka, A Grammar of Biblical Hebrew 
(rev. Eng. ed.; Rome: Pontifical Biblical Institute, 2006) 607-10. 
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Thy thou(m) stand 


* oblq 
b obj addr 


gram 


gram 


+ nna door of 

— ‘1 the 

bak tent 
^ and 

mm he will be 

WR man 

Nim 


he will come 


"NU he will ask 


T thee(f) 


as he will say 


a 
i gram obla ^ s ———" yesio 
à NS "A "V^ exists 
" 

i Esa — nb here 
| - WN man 

A ia +^ and 

i SERRE EAEI EE ETE SEEE cl vb 


obj addr 
gram 


— ran alll NR 


thou(f) wilt say 
————. iN does not exist 


discourse function. For Judg 4:20, the interim phrase marker? is (21.2) (see p. 321). We make 
four observations regarding the structure of this phrase marker: 


1. The three main clauses making up the protasis are gathered into a (supra-clausal) sentence 


licensed by the union relation. The sent /union/disj node specifies the form of the discourse 
unit. This node is dominated by a cond/cue node which tells the function of the discourse 
unit (cond = condition = protasis) and dominates both the sentence and cue phrase, DN ‘if’. 


2. The segment sequence 7771 is dominated by a cue /cue node. The two segments together 


sign.? Regarding compound cue phrases, see §21.3.2.3. 


form a compound cue phrase, referred to variously as a text-deictic or macro-syntactic 


28. Strictly speaking, a phrase marker discloses the structural representation of a clause consisting of phrases. 


What we have here might more precisely be termed a clause marker since it discloses—albeit in incomplete form— 


terminology. 


the structure of a discourse consisting of clauses. In the interests of simplicity, for now we will retain the less precise 


29. Cue phrase: C. H. J. van der Merwe, J. A. Naudé, and J. H. Kroeze, A Biblical Hebrew Reference Grammar 
(Sheffield: Sheffield Academic Press, 1999) 331; text-deictic: Bruce K. Waltke and M. O'Connor, /BHS, 54 n. 24, 634; 


macro-syntactic sign: M. O'Connor, “Discourse Linguistics and the Study of Biblical Hebrew" in Congress Volume: 
Basel, 2001) VTSup 92; Leiden: Brill, 2002) 18. 
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cue Cue s, gram L and 
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X sent cond . 
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1 wa sent cl sbj 5 
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1 NS 
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3. The apodosis is labeled as being a result licensed by the cue phrase 1. Other discourse units 
and relations in this complex verse are not yet specified. 


unit/no cue. 


4. In addition to the use of dashed edges, the provisionality of the present analysis is shown 
by discourse units that are often labeled with the indeterminate labels unit/cue and 


21.2.2 Partially Installed Categories 


In our glossary, an obelisk (1) identifies these categories that have not yet been completely 
marked: 


condition 


exocentric absolute 
| cue phrase 


ruled-over one 
| discourse unit 


janus sb / sc 


separation 


material 


There are also five incompletely assigned licensing relations: 


